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Preface for Volume 2 


On April 4, 2014, we celebrated Cora Sadosky’s life with an afternoon in her honor, 
preceded by the 13th New Mexico Analysis Seminar! on April 3-4, 2014, and 
followed by the Western Sectional Meeting of the AMS on April 5-6, 2014, all 
held in Albuquerque, New Mexico, USA. It was a mathematical feast, gathering 
more than a hundred analysts — fledgling, junior and senior — from all over the 
USA and the world such as Canada, India, Mexico, Sweden, the UK, South Korea, 
Brazil, Israel, Hungary, Finland, Australia, Venezuela, and Spain, to remember her 
outspokenness, her uncompromising ways, her sharp sense of humor, her erudition, 
and above all her profound love for mathematics. 

Many speakers talked about how their mathematical lives were influenced by 
Cora’s magnetic personality and her mentoring early in their careers and as they 
grew into independent mathematicians. Particularly felt was her influence among 
young Argentinian and Venezuelan mathematicians. Rodolfo Torres, in a splendid 
lecture about Cora and her mathematics, transported us through the years from 
Buenos Aires to Chicago back to Buenos Aires, from Caracas to the USA back 
to Buenos Aires, and from Washington D.C. to California. He reminded us of Cora 
always standing up for human rights, Cora president of the Association for Women 
in Mathematics (AWM), and Cora always encouraging and fighting for what she 
thought was right. 


'The 13th New Mexico Analysis Seminar and An Afternoon in Honor of Cora Sadosky were 
sponsored by National Science Foundation (NSF) Grant DMS-140042, the Simons Foundation, 
and the Efroymson Foundation, and the events were done in cooperation with the Association for 
Women in Mathematics (AWM). See the conferences websites: 

www.math.unm.edu/conferences/13thAnalysis 

people.math.umass.edu/~nahmod/CoraSadosky.html 

An Afternoon in Honor of Cora Sadosky was organized by Andrea Nahmod, Cristina Pereyra, 

and Wilfredo Urbina. The 13th New Mexico Analysis Seminar organizers were Matt Blair, Cristina 
Pereyra, Anna Skripka, and Maxim Zinchenko from the University of New Mexico and Nick 
Michalowski from New Mexico State University. 
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Cora was born in Buenos Aires, Argentina, on May 23, 1940, and died on 
December 3, 2010, in Long Beach, CA. Cora got her PhD in 1965 at the University 
of Chicago under the supervision of both Alberto Calderén and Anthoni Zygmund, 
the grandparents of the now-known Calder6én-Zygmund School. Shortly after her 
return from Chicago, she married Daniel J. Goldstein, her lifelong companion 
who sadly passed away on March 13, 2014, a few weeks before the Albuquerque 
gathering. Daniel and Cora had a daughter, Cora Sol, who is now a political science 
professor at California State University in Long Beach, and a granddaughter, Sasha 
Malena, who brightened their last years. During her life, Cora wrote more than 
50 research papers, a graduate textbook (Interpolation of Operators and Singular 
Integrals: An Introduction to Harmonic Analysis, Marcel Dekker 1979), and she 
edited two volumes: one celebrating Mischa Cotlar’s 70th birthday (Analysis and 
Partial Differential Equations: A Collection of Papers Dedicated to Mischa Cotlar, 
CRC Press, 1989) and one celebrating Alberto Calder6n’s 75th birthday (Harmonic 
Analysis and Partial Differential Equations: Essays in Honor of Alberto Calderon, 
edited with M. Christ and C. Kenig, The University of Chicago Press, 1999). In the 
first volume, we have included a list as complete as possible of her scholarly work. 
Notable are her contributions to harmonic analysis and operator theory, in particular 
her lifelong and very fruitful collaboration with Mischa Cotlar. 

When news of Cora’s passing spread like wildfire in December 2010, many 
people were struck. The mathematical community quickly reacted. The AWM 
organized an impromptu memorial at the 2011 Joint Mathematical Meeting (JMM), 
as reported by Jill Pipher, at the time AWM president: 


Many people wrote to express their sadness and to send remembrances. The AWM business 
meeting on Thursday, January 6 at the 2011 JMM was largely devoted to a remembrance of 
Cora. 


This appeared in the March-April issue of the AWM Newsletter” which was entirely 
dedicated to the memory of Cora Sadosky. 

An obituary by Allyn Jackson for Cora Sadosky appeared in Notices of the 
American Mathematical Society in April 2011.° 

In June 2011, Cathy O’Neal wrote in her blog mathbabe* a beautiful remem- 
brance for Cora: 


[...] Cora, whom I met when I was 21, was the person that made me realize there is a 
community of women mathematicians, and that I was also welcome to that world. [...] 
And I felt honored to have met Cora, whose obvious passion for mathematics was absolutely 
awe-inspiring. She was the person who first explained to me that, as women mathematicians, 
we will keep growing, keep writing, and keep getting better at math as we grow older [...]. 
When I googled her this morning, I found out she’d died about 6 months ago. You can read 


2 President’s Report, AWM Newsletter, Vol. 41, No. 2, March-April 2011, p. 1. This issue was 
dedicated to the memory of Cora Sadosky and it was partially reproduced in Volume 1. 


3Notices AMS, Vol. 58, Number 4, April 2011, pp. 613-614. 
“http://mathbabe.org/201 1/06/29/cora-sadosky/ 
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about her difficult and inspiring mathematical career in this biography.° It made me cry and 
made me think about how much the world needs role models like Cora. 


In 2013, the Association for Women in Mathematics established the biennial 
AWM-Sadosky Prize in Analysis,° to be awarded every other year starting in 2014. 
The purpose of the award is to highlight exceptional research in analysis by a woman 
early in her career. Svitlana Mayboroda was the first recipient of the AWM-Sadosky 
Research Prize in Analysis awarded in January 2014. Mayboroda contributed a 
survey paper joint with Ariel Barton to the first of this series of two volumes. As 
the first volume went into press, the second recipient of the award, the 2016 AWM- 
Sadosky Prize, was announced: Daniela de Silva, from Columbia University. The 
award was presented to her in the January 2016 Joint Mathematical Meeting. 

In 2015, Kristin Lauter, president of the AWM, started her report in the May-June 
issue of the AWM Newsletter,’ with a couple of paragraphs remembering Cora: 


I remember very clearly the day I met Cora Sadosky at an AWM event shortly after I got my 
PhD, and, knowing very little about me, she said unabashedly that she didn’t see any reason 
that I should not be a professor at Harvard someday. I remember being shocked by this idea, 
and pleased that anyone would express such confidence in my potential, and impressed at 
the audacity of her ideas and confidence of her convictions. 

Now I know how she felt: when I see the incredibly talented and passionate young female 
researchers in my field of mathematics, I think to myself that there is no reason on this earth 
that some of them should not be professors at Harvard. But we are not there yet ...and 
there still remain many barriers to the advancement and equal treatment of women in our 
profession and much work to be done. 


In these two volumes, friends, colleagues, and/or mentees have contributed 
research papers, surveys, and/or short remembrances about Cora. The remem- 
brances were sometimes weaved into the article submitted (either at the beginning or 
the end), and we have respected the format each author chose. Many of the authors 
gave talks in the 13th New Mexico Analysis Seminar, in An Afternoon in Honor of 
Cora Sadosky, and/or in the Special Sessions of the AMS; others could not attend 
these events but did not think twice when given the opportunity to contribute to this 
homage. 

The mathematical contributions naturally align with Cora’s mathematical inter- 
ests: harmonic analysis and PDEs, weighted norm inequalities, Banach spaces and 
BMO, operator theory, complex analysis, and classical Fourier theory. 

Volume 1 contains articles about Cora and her mathematics and mentorship, 
remembrances by colleagues and friends, her bibliography according to Math- 
SciNet, and survey and research articles on harmonic analysis and partial differential 
equations, BMO, Banach and metric spaces, and complex and classical Fourier 
analysis. 


>Biographies of Women in Mathematics: Cora Sadosky http://www.agnesscott.edu/lriddle/women/ 
corasadosky.htm 


®More details in the AWM-Sadosky Research Prize in Analysis webpage: https://sites.g0ogle.com/ 
site/awmmath/programs/sadosky-prize. 


7President’s Report. AWM Newsletter, Vol. 45, No. 3, May-June, p. 1 (2015). 
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The year 2014 saw the resolution of the two-weight problem for the Hilbert 
transform 4 la Muckenhoupt by Michael Lacey, Erik Sawyer, Chun-Yen Shen, and 
Ignacio Uriarte-Tuero, a problem that had been open for 40 years. This problem 
was solved a la Helson-Szegé by Cora Sadosky and Mischa Cotlar in the early 
1980s using complex analysis and operator theory methods. In the last 15 years, 
a number of techniques have been developed and refined to yield this result, 
including stopping time arguments, Bellman functions, Lerner’s median approach, 
and bumped approach. 

Volume 2 contains more remembrances and survey and research articles on 
weighted norm inequalities, operator theory, complex analysis, dynamical systems, 
and dyadic harmonic analysis. The articles illustrate surprising connections to 
Tauberian functions, number theory, and wavelet systems. A survey of the two- 
weight problem for the Hilbert transform by Michael Lacey is featured. 

Before describing in detail the contents of the second volume in this series, we 
would like to end with some words by Nikolai Nikolski® regarding Cora Sadosky’s 
joint work with Mischa Cotlar on the two-weight problem: 


My next impression on Mischa’s mathematics is dated about 10 years later when his great 
series of papers with Cora Sadosky on Generalized Toeplitz Kernels (GTK) started to 
appear. On the age when all people involved in “weighted analysis” were excited with the 
Muckenhoupt-type approach (which is efficient for real variable applications), the Cotlar- 
Sadosky’s idea to develop Helson-Szegé classical techniques was revolutionary. They 
immediately obtained important applications of the GTK theory in a variety of domains 
where complex analysis language is more appropriate than the real analysis one (scattering 
theory, Hankel and Toeplitz operators, dilation theory..., but also singular integrals 
for so important problems as the famous two-weighted estimates). This Cotlar-Sadosky 
series appeared almost simultaneously with the well-known Krein’s school achievements 
(Adamyan-Arov-Krein) and the Lax-Phillips approach to scattering theory, but the GTK 
theory showed several advantages (as, for example, an important - and growing with time! 
- efficiency in several complex variables.) 


Contents of Volume 2 


We now describe in more detail the contents of the second volume. Volume 2 
includes remembrances, photos, two survey articles, and research articles by an 
array of mathematicians representing themes at the heart of Cora’s mathematical 
interests: weighted inequalities, complex analysis, and operator theory. 

In the chapter “Remembering Corita”’, author and poet Margaret Randall, 
longtime friend of Corita and her parents, Manuel and Cora, shares her memories of 
the family, and in chapter “Remembering Cora” Neil Hindman, Howard University 
Cora’s former colleague, shares a few memories. 


8This is a paragraph in a remembrance for Mischa Cotlar that can be found at www.math.unm.edu/ 
conferences/10thAnalysis/resources/cotlar/nikolski.pdf. 
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In the chapter “The Two-Weight Inequality for the Hilbert Transform: A Primer,” 
Michael Lacey surveys the resolution of the two-weight problem for the Hilbert 
transform. This is a valuable and insightful survey on the recent advances on the 
two-weight inequality for the Hilbert transform, where the author has been a leading 
contributor. With the fast development of the field in the last few years, hundreds 
of pages have been published. While much of that has quickly become outdated 
with the arrival of more powerful and more efficient approaches, there are still 
important parts in the non-latest papers that have not been redone or surpassed by 
the newest developments. In this survey article, there is a detailed presentation of 
the unconditional characterization of the two-weight inequality by Lacey, Sawyer, 
Shen, and Uriarte-Tuero, incorporating at the core a refinement by Hyt6énen. An 
earlier conditional result of Nazarov, Treil, and Volberg, under a “pivotal” condition, 
is reworked in the style of the more recent papers for the sake of both simplification 
and easier comparison. Some important counterexamples and a thorough discussion 
of motivation and applications are presented. Michael Lacey reflects in his article 
about Cora Sadosky and how his research was strongly influenced by her passion 
and interests, he gave a talk titled Cora Sadosky Influence on my Work in an AMS 
Special Session on “Harmonic Analysis and Operator Theory (In Memory of Cora 
Sadosky)” co-organized by two of the editors of this volume, Stokolos and Urbina, 
in Albuquerque on April 2014. 

In the chapter “Singular Integrals, Rank-One Perturbations, and Clark Model 
in General Situation,’ Constance Liaw and Sergei Treil present a survey and 
discuss generalizations of the Clark model to the case of non-singular measures 
and applications to the study of rank-one perturbations for unitary and self-adjoint 
operators. This survey summarizes several well-known papers in that direction 
written previously by the authors and gives some new ideas on the construction of 
a similar model for dissipative operators. Rank-one perturbations play an important 
role in operator theory and mathematical physics. One of the principal attractions of 
rank-one perturbations is that for such operators almost everything can be explicitly 
computed, and then advanced techniques of harmonic analysis, like the study of 
fine properties of Cauchy-type integrals or advanced theory of singular integral 
operators, can be applied. These lecture notes give an account of the mini-course 
delivered by the authors in the 13th New Mexico Analysis Seminar in April 2014 
on Perturbations, Two-Weight Estimates, and Clark Model. Sergei Treil also gave an 
invited lecture on Two-Weight Estimates Following Arocena-Cotlar-Sadosky during 
“An Afternoon in Honor to Cora Sadosky” held in Albuquerque, NM, in April 2014. 

In the chapter “On Two-Weight Estimates for Dyadic Operators,’ Oleksandra 
Beznosova, Daewon Chung, Jean Moraes, and Maria Cristina Pereyra discuss 
quantitative two-weight estimates for dyadic operators in a nonhomogeneous 
setting. They review the prior known estimates for the maximal dyadic function, 
dyadic square function, martingale transform, and the dyadic paraproduct. They 
compare their results for the dyadic paraproduct to results of Holmes, Lacey, and 
Wick on a homogeneous setting where the two weights are assumed to be in the 
Muckenhoupt A, class and the Bloom BMO property is necessary and sufficient for 
boundedness of the dyadic paraproduct and its adjoint. In this paper, the weights 
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are not necessarily doubling, they satisfy a joint A condition, and the dyadic square 
function is assumed to be two-weight bounded. 

In the chapter “Potential Operators with Mixed Homogeneity,’ Calixto Calderon 
and Wilfredo Urbina present a fitting tribute to Corita (in the reviewer’s words). In 
1966 Cora Sadosky discussed a quasi-homogeneous version of Sobolev’s immersion 
theorem. Later the first author and T. Kwembe proved a similar result for potential 
operators with kernels having mixed homogeneity very much in the spirit of 
Sadosky’s result. The aim of this paper is to extend Calder6n-Kwembe’s theorem in 
two directions: first by establishing a corresponding result in terms of mixed norms 
in the Benedek-Panzone’s sense and second by obtaining results for the case of 
unbounded characteristics. 

In the chapter “Elementary Proofs of One-Weight Norm Inequalities for Frac- 
tional Integral Operators and Commutators,” David Cruz-Uribe presents new proofs 
of some recent results concerning weighted inequalities for the fractional integral 
operator and its commutator with BMO functions. The author reduces the problem 
to obtaining estimates for a sparse fractional operator which majorizes the fractional 
integral operators. As pointed out by the author, the advantage of this approach 
is its simplicity: it avoids extrapolation, good-A inequalities, and comparisons to 
the fractional maximal operator; however the proofs do not give sharp dependence 
on the A, characteristic of the weights; nevertheless this dependence is carefully 
tracked. This is a nice summary and introduction into the modern dyadic techniques 
in weighted inequalities. This chapter should be read in conjunction with the chapter 
about two-weight inequalities for fractional integral operators by Sawyer, Shen, and 
Uriarte-Tuero in this volume. Cruz-Uribe ends with a very touching personal story 
about Cora Sadosky. 

In the chapter “Finding Cycles in Nonlinear Autonomous Discrete Dynamical 
Systems,’ Dmitriy Dmitrishin, Anna Khamitova, Alex Stokolos, and Michai 
Tohaneanu provide an exposition of their recent results concerning cycle 
localization and stabilization in nonlinear dynamical systems. Both the general 
theory and numerical applications to well-known dynamical systems are presented. 
Following on the footsteps of the pioneering work of Grebogi, Ott, Pyragas, York, 
et al., the authors consider associating to a given map f on R” and for each N another 
map Fy on (R")" that will stabilize certain unstable orbits common with the initial 
map f for appropriately chosen parameters. The authors show for a fixed N not all 
unstable orbits may be stabilized by the given control and indicate explicit bounds 
on the multiplier of a T-cycle of f that enable control of the above form to stabilize 
the orbit. This paper will be of primary interest to those with prior experience in 
dynamical systems; fortunately the authors provide a very suitable list of references 
for those who wish to study the prerequisites necessary for a good understanding of 
this paper. Alex Stokolos gave an invited talk on Complex and Harmonic Analysis 
in Nonlinear Dynamics during “An Afternoon in Honor of Cora Sadosky” held 
in Albuquerque, NM, in April 2014. Stokolos ends their article with a heartfelt 
remembrance of Cora Sadosky. 

In the chapter “Smooth Analytic Functions and Model Subspaces,” Konstantin 
Dyakonov surveys the canonical Riesz-Nevanlinna factorization in various classes 
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of analytic functions on the disk that are smooth up to its boundary and model 
subspaces (i.e., invariant subspaces of the backward shift) in the Hardy spaces H? 
and in BMOA. It is the interrelationship and a peculiar cross-fertilization between 
these two topics that the author wishes to highlight. This article deals with the 
canonical factorization in classes of “smooth” analytic functions on the unit disk 
on one hand and the so-called model subspaces on the other hand. The author gives 
a (almost) self-contained presentation (with proofs) of several deep and beautiful 
results which are related in a natural way to the theory of Hankel and Toeplitz 
operators, one of Cora Sadosky’s favorite themes. 

In the chapter “Rational Inner Functions on a Square-Matrix Polyball,”’ Annatoli 
Grinshpam, Dmitry Kaliuzhnyi-Verbovetskyi, Victor Vinnikov, and Hugo Woerde- 
man establish among other results the existence of a finite-dimensional unitary 
realization for every matrix-valued rational inner function from the Schur-Agler 
class on a unit square-matrix polyball. It is well known that for polydisks, the 
Schur-Agler class and the Schur class only coincide in dimensions 1 and 2. 
Furthermore, in the other cases, though one might naively expect otherwise, not 
all rational inner functions are in the Schur-Agler class. As a consequence, it is of 
interest to characterize those which are. The authors describe those matrix-valued 
rational inner functions in the Schur-Agler class over square-matrix polyballs, which 
includes the polydisks. This is done in terms of the transfer function representation, 
which, as might be expected, coincides with the unitary colligation being finite 
dimensional. The last section of the paper attempts in the scalar case to more directly 
describe the polynomials appearing the Koranyi and Vagi description when dealing 
with Schur-Agler class rational inner functions. This connects with much interesting 
recent work, including by the authors, on determinantal representations. Finally, a 
number of thought-provoking open questions are posed. 

In the chapter “A Note on Local Hélder Continuity of Weighted Tauberian 
Functions,” Paul Hagelstein and Ioanis Parissis discuss how Solyanik estimates may 
be used to establish local Hélder continuity estimates for the Tauberian functions 
associated to the Hardy-Littlewood and strong maximal operators in the context 
of Muckenhoupt weights. The Tauberian condition for the geometric maximal 
operators was introduced by A. Cérdoba and R. Fefferman in 1977. In this nice and 
clearly written article, the authors introduce the Tauberian function as a weighted 
generalization of the halo function, which is a classical object in the theory of 
differentiation of integrals, and they establish belonging of the function to the local 
Holder classes, with the Hélder exponent proportional to the reciprocal value of the 
Ago norm of the weight. The main tool used is Solyanik estimates — a series of results 
initiated by A. Solyanik in 1995 and developed further by the authors of the current 
article. In particular, they proved that Solyanik estimates imply the continuity of 
halo function for the most important case of density bases. The main result of the 
article provides the quantitative version of continuity of the Tauberian function for 
bases of all cubes and all rectangles with sides parallel to the coordinate axis. The 
novelty of the article is in the remarkable connection of the Ag, norm of the weights 
with the smoothness of the Tauberian functions. 
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In the chapter “Three Observations on Commutators of Singular Integral Oper- 
ators with BMO Functions,’ Carlos Pérez and Ismael Rivera present interesting 
observations concerning commutators of singular integral operators with BMO 
functions. Namely, they discuss sharpness of the sub-exponential local decay, 
sparse domination result for commutators, and the failure of an endpoint estimate 
motivated by the conjugation method. This paper is very useful not only for the 
researcher who is carefully studying the commutator but also for the beginners 
who want to be acquainted with the theory of commutators. Carlos Pérez gave an 
invited talk on Optimality of Exponents and Yano’s Condition in Weighted Estimates 
and Endpoint Estimates during “An Afternoon in Honor of Cora Sadosky” held in 
Albuquerque, NM, in April 2014. 

In the chapter “A Two-Weight Fractional Singular Integral Theorem with Side 
Conditions, Energy, and k-Energy Dispersed,” Erik Sawyer, Chun-Yen Shen, and 
Ignacio Uriarte-Tuero present a follow-up to a paper of theirs, where the two-weight 
inequality for fractional singular integral operators is studied under the assumption 
that the pair of weights does not have common point masses. In this chapter, the 
authors allow for common point masses. Under appropriate Muckenhoupt (joint A? 
and punctured A? conditions) and a-quasi-energy side conditions, the authors show 
that a fractional singular integral operator, T,,, is bounded from one weighted space 
to another if certain quasicube testing conditions (involving a globally bi-Lipschitz 
map Q : R” — R") hold for 7, and its dual and if the quasiweak boundedness 
property holds for T,,. Conversely, if TJ, is bounded, then the quasitesting conditions 
hold, and the quasiweak boundedness condition holds. It is unknown whether the 
quasi-energy conditions are necessary in higher dimensions in general. This is a 
highly technical paper and the authors do a very good job placing road maps and 
diagrams and sometimes iterating ideas so the reader does not get lost. 

In the chapter “A Partition Function Connected with the Gdllnitz-Gordon 
Identities,’ Nicolas A. Smoot presents a Rademacher-type formula for the partition 
of a positive integer into parts of special type, associated to the Géllnitz-Gordon 
identities. The subject of this paper constitutes a beautiful application of complex 
analysis to number theory. The proof of the main result follows in the spirit of the 
Rademacher approach and involves its basic components — generating functions, 
Rademacher contours, the Hardy-Ramanujan circle method, Bessel functions. The 
argument is quite involved but very well written and illustrated by nice sketches. 
The paper is almost self-contained and is accessible to nonexpert researchers and 
graduate students. 

In the chapter “On Toeplitz Operators with Quasi-radial and Pseudo- 
homogeneous Symbols,” Nikolai Vassilevskii explores a new wide class of symbols 
that generate commutative Banach algebras on each weighted Bergman space 
on the unit ball in C”. These symbols are a natural extension of the previously 
studied quasi-radial quasi-homogeneous symbols and contain them as a very special 
particular case. The commutative C*-algebras of Toeplitz operators on Bergman 
spaces of the unit ball B” of C” are fairly well understood nowadays. Some examples 
of commutative Banach (not C*) algebras of such operators are known. The current 
paper exhibits a new wide class of functions that generate such algebras on each 
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of the standard weighted Bergman spaces on B”. The results are of interest and 
contribute toward the (as yet fairly incomplete) understanding of the commutative 
subalgebras of Toeplitz operators on the ball in C”, in dimension larger than one. 

In the chapter “A Bump Theorem for Weighted Embedding and Maximal 
Operator: The Bellman Function Approach,” Sasha Volberg gives a simple Bellman 
function proof of Carlos Pérez’s “bump theorem” for the two-weight estimates of 
maximal operators. The author shows how the Bellman function method proves a 
certain “discrete” inequality (i.e., a bound on discrete operator), which, in its turn, 
implies a bound of a certain continuous operator. The main difference from a, by 
now, classical procedure is that here one needs a Bellman “functional.” Since the 
operator under consideration (maximal function) is, in some sense, easy, the paper 
is intended to give a clear and not very technical presentation of the method. 

In the chapter “The Necessity of Ago for Translation and Scale Invariant Almost- 
Orthogonality,’ Mike Wilson continues his study of general wavelet systems in the 
context of weights. For a weight jz in the Muckenhoupt class Ag, the author has 
shown previously that for sufficiently nice mother wavelets and arbitrary T-systems, 
the resulting system is almost orthogonal in L?(jz). This paper concerns the converse 
result. In the reviewer’s own words: The current version of the converse seems 
now fully satisfactory. It has the nice feature then that if you can obtain the 
almost orthogonality in L’(dt) for one choice of a wavelet meeting the minimum 
requirements and all T-systems, then one can conclude that [t € Ago, and then that 
one has the almost orthogonality in L?() for all reasonable wavelets systems and 
all T-systems, by the other direction of the theorem. This “prove it for one, get it for 
all” feature, although not uncommon in this general area, is quite pleasing. 

Alex Stokolos, Sergei Treil, and Carlos Pérez were invited speakers to “An 
Afternoon in Honor of Cora Sadosky.” Beznosova, Cruz-Uribe, and Pereyra and 
Stokolos and Urbina were co-organizers of AMS Special Sessions on “Weighted 
Norm Inequalities and Related Topics” and on “Harmonic Analysis and Operator 
Theory (In Memory of Cora Sadosky),” respectively, on April 5-6, 2014, held 
in Albuquerque, NM. Michael Lacey, Oleksandra Beznosova, Daewon Chung, 
Jean Moraes, Paul Hagelstein, Dmitry S. Kaliuzhnyi-Verbovetskyi, Constance 
Liaw, Nikolai Vassilevskii, and Mike Wilson gave talks in the AMS meeting in 
Albuquerque in April and honored there as they are doing here the life and work of 
Cora Sadosky. Vinnikov is Cora Sadosky’s coauthor; in fact they were both co- 
authors of the abstract for the AMS talk delivered by Kaliuzhnyi-Verbovetskyi. 
Other authors could not make it to the conference but were more than happy to 
contribute to this volume. 
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Part I 
More Remembrances and photos 


Remembering Corita... 


Margaret Randall 


Corita Sadosky—her mother’s name was Cora, so we called her Corita—was an 
Argentinean mathematician who belonged to her country’s “lost generation.” What 
that meant was that she came of age during the 1970s, dark years of dictatorship 
when 30,000 young people were murdered by neo-fascist forces much like those 
once again threatening us in so many countries today. Some because they were 
actively working against the dictatorships that had taken control in the Southern 
Cone of South America. Some because they were simply young, and youth alone 
was considered a crime. Most for some combination of both. 

I first met Corita and her parents, Cora and Manuel, when we had all taken refuge 
in Cuba. Corita and her family, including her husband Daniel Goldstein, had come to 
the Caribbean island by way of Venezuela. Eventually they had a daughter, Corasol. 
My family and I had come from Mexico. Those years saw interwoven webs of exiles 
moving from country to country, often just a few steps ahead of death threats or 
worse. Corita and I became instant friends. 

In the summer of 1973 it seemed our liberated territories had expanded; Chile, 
under Salvador Allende was trying to show the world that freedom could come 
not only by means of armed struggle but also through democratic elections. People 
throughout the Americas were hopeful. In Peru the Velasco Alvarado government 
was developing progressive initiatives, especially for the country’s large Quechua 
population. Corita was invited to head a program there, to improve women’s lives. 
The program involved an oral history project, in which they would ask women 
themselves what they needed. 

Looking back, I can’t help but smile. Cora had been invited because she was 
a woman, without consideration of her life as a mathematician. She promptly told 
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the Peruvian organization issuing the invitation that she was not the best person 
for the job. She recommended me—already a feminist and oral historian—and 
that is how I happened to travel to Peru in the fall of 1973, just after Pinochet’s 
bloody coup toppled the Chilean experiment. Hundreds of Chileans were making 
their way across their country’s northern border into Peru. I would spend 3 months 
working for the United Nations’ International Labor Office in what turned out to be 
a fascinating project. Unfortunately, it too would be doomed. 

From then on, Corita’s and my friendship grew. We spent time together in Cuba 
and later in Washington, D.C., where she eventually ended up, teaching at Howard 
University. Our families, too, remained interwoven—in that strong fabric made up 
of Latin American revolutionaries forced to migrate from country to country. I 
remember Cora’s sudden death, and a visit to Manuel and his new wife in Buenos 
Aires. My son, Gregory, at the time vice president of the University of the Republic 
in Uruguay, bestowed an honorary doctorate on Manuel just months before the 
latter died. Corita made a life for herself in the United States. Despite having been 
displaced from the country of her birth, she adapted to Washington, taught several 
generations of upcoming mathematicians, and continued with the research that had 
become her life’s passion. My last memories of Corita are from a visit Barbara and 
I made to her in our nation’s capital. 

When Corita died, it was a shock. We hadn’t been in touch for a few years, but I 
had no doubt she was there—in that “there” we shared for so many years although its 
physical location might vary. Like many, she died much too young. But, like some, 
she had had the strength to overcome a violent displacement that has defeated so 
many. 

It is wonderful that she is being honored in this beautiful book. She was a good 
friend. But she was also an example of the humanity, brilliance and fortitude needed 
again today to keep on fighting the good fight against the powers of disrespect and 
arrogance so obvious on today’s political scene. 

October 2016 


Margaret Randall (New York, 1936) is a poet, essayist, oral historian, translator, photog- 
rapher and social activist. She lived in Latin America for 23 years (in Mexico, Cuba, and 
Nicaragua). Randall’s most recent titles include “She becomes time”, “Che on my mind”, 
“Haydée Santamaria, Cuban revolutionary: she led by transgression”, and “Exporting 
revolution: Cuba’s global solidarity”. Randall has also devoted herself to translation, 
producing “Only the road/Solo el camino”, an anthology of eight decades of Cuban poetry, 
among other books. She lives in New Mexico with her partner (now wife) of more than 30 
years, the painter Barbara Byers, and travels extensively to read, lecture and teach. 


Remembering Cora 


Neil Hindman 


I spent almost three decades in an office adjacent to Cora’s office. We were in 
a building separate from the department office, where most of the Mathematics 
Department faculty had their offices. We were fortunate, since we had individual 
offices, while those in the other building had two or three people to an office. We 
were unfortunate because the university doesn’t tend to turn on the heat in our 
building until mid November. We both had electric space heaters, and when we 
ran them at the same time, we would blow a circuit breaker. 

My main interaction with Cora was the fact that we shared my coffee maker. 
Every morning when she came in to the office, she would come and get a cup of 
coffee—we both drank it black. And she would occasionally bring in a can of ground 
coffee. 

Beyond that, the main thing I remember is that Cora did not like to carry the 
heavy calculus text book to class. So she would tear out the relevant pages for the 
day’s lesson and just take those to class. 
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Dinner with MSRI’s Human Resources Advisory Committee, November 10, 2004, at the house of 
Director David Eisenbud. Photo by Monika Eisenbud 


1996 University of Chicago Conference in honor of Alberto Calderén’s 75" Birthday: Front 
row, seated (left to right): M. Christ, C. Sadosky, A.P. Calderon, M.A. Muschietti. First row, 
standing (left to right): C.E. Kenig, J. Alvarez Alonso, C. Gutierrez, E. Berkson, J. Neuwirth. 
Second row, standing (left to right): A. Torchinsky, J. Polking, S. Vagi, R.R. Reitano, A.E. Gatto, 


R. Seeley. 
Photo courtesy of Cora Sol Goldstein, photographer unknown 


Part II 
Survey Articles 


The Two Weight Inequality for the Hilbert 
Transform: A Primer 


Michael T. Lacey 


In memory of my father, H. Elton Lacey 


Abstract Given a pair of weights w,o, the two weight inequality for the Hilbert 
transform is of the form ||H(of)|l22~) S [lf llzz(0). Recent work of Lacey-Sawyer- 
Shen-Uriarte-Tuero and Lacey have established a conjecture of Nazarov-Treil- 
Volberg, giving a real-variable characterization of which pairs of weights this 
inequality holds, provided the pair of weights do not share a common point mass. 
In this paper, the characterization is proved, collecting details from across several 
papers; counterexamples are detailed; and areas of application are indicated. 


Introduction 


By a weight we mean a non-negative Borel locally finite measure, typically on R. 
We consider the two weight inequality for the Hilbert transform for a pair of weights 
w,o onR: 


lH (Fo) |e < Nifllee@: (1) 


Here, N denotes the best constant in the inequality. And Hv(x) is the Hilbert 
transform of v 
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d 
Huta) = f = 2) 


We do not insist on the existence of the principal value, a point addressed in 
section “Principal Values”. 

The central question is then a real-variable characterization of the inequality (1). 
In the special case that the pair of weights o and w do not share a common point 
mass, this was supplied in three papers, one of Lacey-Sawyer-Shen-Uriarte-Tuero 
[23] with the refinement of Hyt6nen [15], and another of the present author [17], 
answering a beautiful conjecture of Nazarov-Treil-Volberg [56]. 


Theorem 1.1 Define two positive constants Az and J as the best constants in the 
inequalities below, uniform over intervals I, and with respect to interchanging the 
roles of o and w. 


I 
ae - P(wI\.l) < Ad, @) 
[Hoy av =< Po, (4) 
I 


There holds N ~ H := ane +7. 


The first condition is an extension of the Muckenhoupt A» condition to a ‘half 
Poisson condition with a hole.’ The exact Poisson extension of o to the upper half- 
plane is not needed, rather we use the approximation below, which is roughly the 
Poisson extension evaluated at the center of J, and up into the half-plane the length 
of J, see Fig. 1. 


[Z| 


PO.D= | ayaa? 


o (dx). (5) 


Po(x1, |I|) > Pio, I) 


x 
I 


Fig. 1 The value of P(o, J) is approximately the Poisson extension of o evaluated at point in the 
upper half-plane given by the center of J, and the length of J 
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The remaining conditions are referred to as the Sawyer-type testing conditions, 
as Eric Sawyer first introduced these conditions into the two weight setting in 
his fundamental papers on the maximal function [52], and later the fractional 
and Poisson integral operators [53]. It is well-known that the Az condition (3) is 
necessary for the two weight inequality, and it is obvious that the testing conditions 
are necessary. Thus, the substance of the Theorem above concerns the sufficiency 
of the A and testing inequalities for the norm inequality. 

This Theorem is a central result in the non-homogeneous harmonic analysis, 
as founded in a sequence of influential papers of Nazarov-Treil-Volberg [35-37]. 
The proof of the theorem is involved, encompassing arguments and points of view 
that were spread across several papers [17, 21, 23, 38]. Finally, the interest in the 
two weight inequality is well-motivated by applications to operator theory, model 
spaces, and spectral theory, themselves spread across additional papers. 

The point of this paper is to 


(a) state and prove the Theorem, in all detail. 

(b) give the proof under the influential pivotal condition, which serves to highlight 
where the difficulties arise in the general case; 

(c) collect relevant, explicit, counterexamples; 

(d) give complements and extensions of the theorem, and the proof techniques; 

(e) and point to areas of applications. 


Sections proceed directly towards proofs, but many conclude with some context and 
discussion. The proof is entirely elementary, assuming only the well known facts 
about martingale differences. 


An Overview of the Proof 


The result is an individual two weight inequality. It characterizes the boundedness 
of the Hilbert transform, and no other operator. Therefore, particular properties of 
this transform must guide the proof. The elementary examples of these are the 
monotonicity principle, Lemma 3.2, valid for all pairs of weights, and then the 
energy inequality, Lemma 3.3, valid under the assumption of interval testing and 
the A» condition. These properties are a last vestige of positivity: The kernel ; is 
monotone increasing on R \ {0}. This feature will deliver to us the energy inequality; 
finding it, and unlocking its secrets is the key to the proof. 

The main line of the argument begins with the bilinear form (H,f, g)y. It’s 
decomposition is made to ‘regularize’ all four quantities in the expression, the two 
functions f and g, as well as the ‘irregularities’ of the pair of weights, as expressed 
by the energy inequality. Only half of the decomposition needs to be specified, due 
to the self-dual nature of the question, and some of these considerations are familiar 
to experts in both the 71 and the 7b theorems. But the underlying difficulties do not 
have any classical analog. 
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Fig. 2 A schematic tree of (Hef, g) 
the proof of the main theorem aise 


——— Az 


Triangular Forms (4.3) 


—______| Global to Local §5 


Local Estimate (4.11) 


——__ testing 


Stopping Form (6.1) + Size Lemma 6.7 


The proof strategy is outlined in Fig.2. The passage to the ‘triangular forms’ 
in Lemma 4.1 is a rather standard step in many T1-type theorems. The Calderén- 
Zygmund stopping data defined in section “Global to Local Reduction” is the 
foundational tool. It (a) controls the values of certain telescoping sums of martingale 
differences; (b) regularizes the weights, from the point of view of the energy 
inequality; and (c) allows the use of the guasi-orthogonality argument, an important 
simplification. The triangular forms are of a ‘local’ and a ‘global’ form, and have 
dual forms as well. There are two steps in the analysis, a ‘global to local’ reduction 
in section “Global to Local Reduction’, and an analysis of the ‘stopping form’ in 
the section “The Stopping Form”. 

The stopping data is essential to the ‘global to local reduction’ in Theorem 4.4. 
A simple appeal to the testing condition, allows an application of the monotonicity 
principle to rephrase the inequality in this Theorem as a certain two-weight 
inequality for the Poisson integral. In this inequality, the Poisson integral maps 
functions on R to those on Ri. The weight on R is, say, o. The weight on Ri is 
then derived from w in a specific fashion from the stopping data, and hence depend 
upon f and the pair of weights. But the Poisson operator is a positive operator, and 
one has a quite adequate understanding of their two weight inequalities. We directly 
implement this understanding, without proving any more general result. 

The local term is then dominated by the analysis of the stopping form (53). 
This is again a familiar object, to experts in T1 theorem, addressed by ad hoc off- 
diagonal estimates, which absolutely do not apply in the current context. Control of 
the irregularities of the weights is now the main point, complicated by the fact that 
the stopping form is not intrinsically defined. A notion of ‘size’ is introduced—it 
serves as an approximate of the operator norm of the stopping form, and again is 
most naturally defined in terms of a measure on R2., derived from the two given 
weights. The size lemma, Lemma 6.3, decomposes a stopping form into constituent 
parts. Those of large size have a simpler form, which allows one to estimate their 
operator norm by size. What is left has smaller size, and so one can recurse. This 
argument relies heavily on the Hilbertian structure of the question. 
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Some readers will have noticed that a very common set of objects, Carleson 
measures, are not mentioned, and indeed, they do not appear in the proof at all. 
The wide spread prevalence of Carleson measures in T1 theorems can be traced to 
two facts, first that associated paraproducts operators are the principle obstacle to a 
simple proof, and second, the paraproduct operators have an essentially canonical 
form. In this theorem, neither of these facts hold, and so we have abandoned the 
notions of Carleson measures and paraproducts. 

Carleson measures are also used to, indirectly, control the sums of martingale 
differences. Rather than this, we use the simpler method of stopping data, as 
described in section “Global to Local Reduction”. 


The A, Theory 


The classical case of an Az weight corresponds to the case of w(dx) = w(x)dx, and 
w(x) > 0 a.e. Moreover, the weight o also has density given by o(x) := w(x)!. It 
is assumed that both w and o are locally integrable, so that they are both weights. 
See Fig. 3. Note that w(x) - o(x) = 1. The Muckenhoupt A> condition asserts that 
this same equality approximately holds, uniformly over location and scale. 


ov) sup UO). 20 
er Wd 


These are ‘simple’ averages. This condition is equivalent to the uniform norm bound 
on L7(w) for the class of simple averaging operators 


1 


for 


fr dx +1), Tis an interval. 
I 


From this condition flows a rich theory, including the boundedness of all Calderén- 
Zygmund operators. The classical result of Hunt-Muckenhoupt- Wheeden [11] states 


Fig. 3 For 0 < € < 1, the function w(x) = |x|!~* is an A> weight. It and the dual weight 
a(x) = |x|*—! are graphed above. One can check that [w]4, ~ «7! 
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that w in A, if and only if the Hilbert transform maps L*(w) to L?(w). By a 
basic change of variables argument, first noted by Sawyer [52], this is equivalent 
to H, mapping L?(c) to L?(w). Stefanie Petermichl [44] quantified the Hunt- 
Muckenhoupt- Wheeden theorem as follows. 


Theorem A A weight w € Az if and only if H is bounded from L?(w) to L?(w), and 
moreover the constant N in (1) satisfies N ~ [w]a,. 


To place this result in the context of our main result, it is classical and easy to see 
that the Poisson Az characteristic satisfies Ar < [w]i,- And, using the remarkable 
Haar shift representation of the Hilbert transform due to Petermichl [43], one can 
check that the testing condition satisfies J < [w]4,. This is what Petermichl’s 
original proof did. All existing proofs of Petermichl’s Theorem (see [13, 18, 29]) 
depend ultimately on known Lebesgue measure estimates for the Hilbert transform, 
or closely related operators. For instance, [16, 29] use the weak-L! (dx) bound for 
sparse shift operators. Estimates of these type are irrelevant for the two weight 
theorem. 

It is perhaps worth emphasizing that the powerful Haar shift technique of 
Petermichl, even with its impressive extension by Hyt6nen [13], seems to be of little 
use in the general two weight problem. There are two obstacles: Firstly, in order to 
use it, one must essentially have control on a Haar shift operator, independently 
of how the grid defining the shift is defined. The resulting condition on the pair 
of weights is more subtle than the two weight inequality for the Hilbert transform. 
Secondly, one should recover the energy inequality of Lemma 3.3. But, the energy 
of any fixed Haar shift is zero, and indeed, the two weight inequality for Haar shift 
operators [39] has just a few difficulties in its proof. 

By the Az Theorem, it is meant the linear in Az bound for all Calderén-Zygmund 
operators. This result, pursued by many, and established by Hyténen [13], has many 
points of contact with the subject of this note. But, we refer the reader to [14] and 
references there in for more information, and see [16] for what is arguably the most 
elementary proof. 

In the A» theory, it is essential that w(x) > 0 a.e. Suppose one relaxes this 
condition to w(x) is positive on a measurable set E € R, and define a(x) to be 
supported on E, and equal to w(x)~!. One can then ask if the Hilbert transform is 
bounded for this pair of weights, and Theorem 1.1 applies here. This question is 
an instance of the non-homogeneous Az theory advocated by A. Volberg. One can 
hope that specificity in the way the weights are prescribed could introduce some 
additional simplifications in the characterization of the two weight inequality in this 
setting. But, none has yet been found. 


The Individual Two Weight Problem 


Given an operator T, the individual L? two weight inequality for T is the inequality 


IITof lori) S Nef llr). (6) 
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Here and throughout we use the notation 7,f := T(of). We understand that T 
applied to a signed measure o - f should make sense. And, the inequality above 
is the preferred form of the inequality as duality is expressed in the natural way: 
The inequality (6) is equivalent to 


Teele = NrIlgllo" (wy: 


The question is then to characterize the pairs of weights for which (6) holds. 

This specificity of the question is of interest for a few canonical operators, ones 
for which the corresponding two weight inequality will naturally present itself. 
The leading examples of this are, for positive operators, the Hardy operator by 
Muckenhoupt [32], the maximal function, Sawyer’s Theorem of 1981 [52] and 
Sawyer’s 1988 theorem for the fractional integrals [53]. It is noteworthy that the 
two weight inequalities for the Hardy and the Poisson integral are used in the proof 
of our main theorem, as are various purely dyadic variants of these Theorems. 

It is interesting to that this is not only a chronological list, but it also reflects the 
depth of the results as well. The Hardy operator is easiest, characterized by an “A2- 
type condition,’ as recalled in Theorem F. It was Sawyer’s insight, however, that 
the maximal function characterization requires a testing condition. The fractional 
integrals are harder still. For the sake of comparison, let us state a special case of 
the result for the fractional integrals in one dimension. Besides Sawyer’s results, one 
should also consult Casscante-Ortega-Verbitsky [6], as well as those of Vuorinen 
[57]. Both results give a characterization in terms of testing conditions. And, while 
we state just one case of the general result, one should note that there is no Sobolev 
condition imposed on the L’ indices. 


Theorem B For two weights w,o, and 0 < a < 1, the operator Rof(x) = 


Sf@- y) maps L?(o) to L?(w) if and only if the testing inequalities below 
hold. , 


/ R,(1))* dw < T’o(D), / R,(1;)? do < T’w(I). 
T T 


Moreover the norm of the operator is equivalent to J, the best constant in the 
inequalities above. 


The analysis of the individual two weight inequality for positive operators is 
much simpler, as is the case of dyadic operators. For certain non-positive dyadic 
operators, see the result of Nazarov-Treil-Volberg [39], and the much more recent 
works of Vuorinen [57, 58]. These results have found significant interest, due to the 
Haar shift operators of Petermichl [43], the remarkable median inequality of Lerner 
[28] and its extension in [16], and the Hyt6nen representation theorem [13]. 

The Hilbert transform is the first non-positive continuous operator for which 
the individual two weight problem has been solved. And, one would only ever 
expect that the solution would be of interest (or even possible) for a few canonical 
choices of operators, such as Hilbert, Cauchy and Riesz transforms. Foundational to 
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the solution for the Hilbert transform is the monotonicity of the kernel. No other 
canonical choice will satisfy such a simple condition. For a special case of the 
Cauchy transform [24] one can make progress. But the case of Riesz transforms 
is much harder [26, 54]. 

The individual two weight question makes sense for any | < p < on, and there 
are characterizations in this, and other off-diagonal cases for positive operators. For 
dyadic analogs of singular integrals Vuorinen [57] has shown that these inequalities 
can be characterized by quadratic testing conditions. Also see [27]. The extension 
of this characterization to the setting of the Hilbert transform is challenging. 


The Hilbert Transform 


The two weight inequality for the Hilbert transform was addressed as early as 
1976 by Muckenhoupt and Wheeden [33].'But, it received much wider recognition 
as an important problem with the 1988 work of Sarason [50]. The latter was 
part of important sequence of investigations that identified de Branges spaces as 
an essential tool in operator theory. His question concerning the composition of 
Toeplitz operators, see section “Sarason’s Question on Toeplitz Operators”, was 
raised therein, and advertised again in [51]. This question related the individual 
two weight problem for the Hilbert transform to a profound question from operator 
theory. 

While not stated in the language of the Hilbert transform, Sarason wrote that it 
was ‘tempting’ to conjecture that the full Poisson Az condition would be sufficient 
for the two weight inequality. In an important development, F. Nazarov [34] showed 
that this was not the case. The two weight problem was seen to be important 
to Model spaces, namely certain embedding questions for Model spaces can be 
realized as a two weight inequality for the Hilbert transform. In particular, a more 
delicate counterexample was developed by Nazarov-Volberg [40] to disprove a 
conjectured characterization of the Carleson measures for a model space. The 
Nazarov counterexample was also used by Nikol’skii-Treil [42], in the context of 
spectral theory. 

The Nazarov counterexample is by way of a Bellman function approach. In 
section “Example Weights”, we give an explicit example. It is worth noting that 
in Sarason’s question, the weights have a density |f|?, for analytic f, and the 
subharmonicity could be an important part of the problem. But, in the context of 
model spaces, completely singular arbitrary measures can arise. In section “Example 
Weights”, one of the weights is uniform measure on a Cantor set. 


'In particular, they noted that the simple A2 condition was not sufficient for the boundedness of 
the Hilbert transform, and conjectured that half-Poisson Az conditions would be sufficient, an 
indication of the powerful sway held by the Muckenhoupt A> condition in the early years of the 
weighted theory. 
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Nazarov-Treil-Volberg were creating the field of non-homogeneous Harmonic 
Analysis, in a series of ground-breaking papers [35-37]. Their work, and a 
revitalization of the perspective of Eric Sawyer from the 1980’s, lead them to 
conjecture the characterization proved in this paper. Moreover, their influential proof 
strategy, devised in [38, 56], lead to a verification of the conjecture in the case that 
both weights were doubling. This paper uses their strategy, with several additional 
features. At the same time, their approach is generic, in that it applies to general 
Calderén-Zygmund operators. Specific properties of the Hilbert transform had to be 
used in the characterization. These properties were identified in [17, 20, 21, 23], and 
the more precise description of what was accomplished at each stage is spread out 
throughout the paper. 


The Circle 


The two weight inequality has an equivalent formulation on the circle, which we 
formulate now. Given two weights w,o on the circle group T = R/27Z, we 
consider the norm inequality 


_ 2 
[| [ t00-00 (—) oly) dw < MU Pace): (7) 


This is abbreviated to ||H?f||r2() < NIlf llz2@)- 


Theorem 1.3. The inequality (7) holds if and only if the pair of weights below satisfy 
the conditions below and their duals. For all intervals I C T, with |I| < 1, there are 
finite constants Ay and J, such that 


is 
- »P™(wlpy) (7, 1 = |I]) < A, (8) 


i IH™1)/2 dw < To(N. (9) 
vi 


Moreover, letting Az and J be the best constants in these inequalities and their 
duals, there holds N ~ Aut aes 


In (8), the term P™ w(x, r) is the standard Poisson operator on the disk, evaluated 
at a point in the unit disk given by the center of the of the interval x;, and the radial 
factor r. 

Let us indicate how to prove the theorem above from Theorem 1.1. Fix o and w 
be two weights on T. Embed the weight w into [0, 1] in the natural way, and call 
the resulting measure w’. Place three copies of o on the intervals [—1, 0], (0, 1] and 
(1, 2], and call the resulting measure o’. Thus, o’ and w’ are two weights on R. It is 
clear that o’ and w’ satisfy the Poisson Az condition with holes on R. 
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For a function f € L?(T;0), let f’ be three copies of f on the intervals [—1, 0], 
(0, 1] and (1, 2]. Viewing T as [0, 1], there is a subtle difference between H?f (x) and 
Hf’ (x), the former computed on T, and the latter on R. Namely 


3 
a — Lyf) < / WO) =P ofa. 


It is easy to see that the A condition implies that the operator on the right is 
bounded. Hence, the testing conditions on T imply those for w’ and o’. Hence Hz, 
maps L”(o’) to L7(w’). From that, we deduce the boundedness of H?. 


Cora Sadosky. Cora Sadosky and I met only a couple of times, which is a 
pity, since my research has been so strongly influenced by her passions and 
interests. Her work with Cotlar on the L’ variant of the Helson-Szegé theorem is a 
beautiful complex variable result well beyond the reach of the current real-variable 
techniques. Her interest in Hankel forms on two and more complex variables has 
been my own for several years. And, in a number of small ways, I work to support 
more diversity in the profession, again following her lead. 

Cora Sadosky’s family came up in 2005, during a three month stay in Argentina, 
in a antiquarian bookstore just a few steps from the Casa Rosada in Buenos Aires. 
The proprietor, upon hearing I was a mathematician, remembered his own youth and 
a compelling Professor Manuel Sadosky. He remembered that the Professor had a 
daughter and asked after her. This was the third or fourth conversation of this type 
I had in that lovely city! It is a privilege to work on the beautiful subject of mathe- 
matics. Even more so to have passion, and insights that others will carry forward. 


Preliminaries 


Principal Values 


We make no assertion about principal values of the Hilbert transform, and do not 
expect them to exist in the generality in which we are considering. One can then be 
concerned about how the definition is made. There are a couple of different options. 
One can impose some sort of truncation on the integrals, and the statements of 
the theorems are then understood to be uniform over all truncations. Many of the 
different possible truncations will be equivalent, since the Az condition will hold, 
see [31] for a general discussion of this issue. Alternatively, one can formally define 


dyd 
Hofraw= ff tore 


for all f, g which have closed supports that are a positive distance apart, and extend 
H linearly from there. 
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In our proof, all of the essential difficulties in the proof arise when f and g have 
widely separated supports. The definition of H, in this case is of course by the 
formula above. 


Dyadic Grids and Haar Functions 


A grid is a collection D of left closed, right open intervals so that for all J, J € D, 
INJ = 9,1, J. Further say that D is a dyadic grid if for all integers n, the collection 
{le D : |I| = 2"} partitions R, aside from the endpoints of the intervals. 

For a sub collection ¥ of a dyadic grid D, set 2/ to be the minimal element of 
F that contains J; J need not be a member of F’.. Set z ai to be the minimal member 
of F that strictly contains J, inductively define qed = rel ). 

Say that the collection D is admissible for weight o if o does not have a point 
mass at any endpoint of an interval J € D. 


Haar Functions 


Let D be admissible for o be a weight on R. If 7 € D is such that o assigns non-zero 
weight to both children of J, the associated Haar function is chosen to have a non- 
negative inner product with the independent variable, (x, h7 (x))« = 0, a convenient 
choice due to the central role of the energy inequality, (19). 


oy. |T)o)(H@) LG) 
hf (x) = |2eote (2 =), (10) 


In this definition, we are identifying an interval with its indicator function, and 
we will do so throughout the remainder of the paper. This is an L(o)-normalized 
function, and has o-integral zero. If o is supported only on one child of J, then we 
set hf = 0. 

For any dyadic interval J) with o(/)) > 0, the non-zero functions among 
{0 (Ip) 7g} U LAE : 1 € D,I C Ip} form an orthonormal basis for L? (Ip, 7). We will 
use the notation ie (Ip, 0) for the subspace of L* (Ip, 0) of functions with mean zero. 
It has orthonormal basis consisting of the non-zero functions in {hf : J € DI C Io}. 
These are familiar properties. But, another familiar property, that the positive and 
negative values of hf are comparable in absolute value, fails in a dramatic fashion 
for non-doubling measures. See Fig. 4. 

We will use the notations E?f = o(1)"! ff do, f(l) = (f, hf), as well as the 
equality below, holding for those J with hf # 0. 


ACh = (fshP)oh? = 1,E?, f + 1Ef_f — IEVS. (11) 
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Fig. 4 Two Haar functions. For the /eft function, the weight is nearly equally distributed between 
the two halves of the interval, in sharp contrast to the function on the right, in which the weight on 
the right half is much larger than on the left 


This is the familiar martingale difference equality, and so we will refer to Af 
as a martingale difference. It implies the familiar telescoping identity EVf = 


Dorr ay AG. 
The Haar support of a function f € L?(c) is the collection {J : f(I) ¥ 0}. 


Random Dyadic Grids 


Let D be the standard dyadic grid in R, thus all intervals [0, 2”] for n € N are in D. 
A random dyadic grid D is specified by @ = {@,} € {0, 1}, and the elements are 


f= + > 2 On, T€ED, 
ni2*e<]| 


The natural uniform probability measure P is placed upon {0, 1}7. 

Fix 0 < ¢ < landr €N. An interval J € D is said to (¢,r)-good if for all 
intervals J € D with |J| > 2’~'J|, the distance from 0J and either child of I is at 
least |J|*|J|!~©. Otherwise / is said to be (e, r)-bad. These are the basic properties of 
this definition. 


Proposition 2.1 These three properties hold. 


(1) The property of I = i+@ being (€, r)-good only depends upon w and |I\. 
(2) Peood = PU is (€, r)-good) is independent of I. 
(3) Pbad *= 1- Pood Ss e2-". 


Proof An interval I = i+a is equally likely to be the left or right half of its parent 
Fant , depending only on w,, where |J| = 2”. Similarly, J is equally likely to be 
any one of the 2’ potential positions in 74,/, and its exact position is determined by 
{On,- ++,» @n+;—-1}. This proves the first two claims. 

For the last, if J is bad, then for some t > r, there holds dist (/, Onp1) < 
2¢—®)"|7|, For this to happen, it is necessary that the numbers {@, : n+ [(1—«)t] < 
u <n-+t-— 1} all be equal, and hence are either all 0 or all 1. This clearly proves 
that 
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foe) 
Pbaa < > 2! (t—[(—e)¢]) <e 0) er 
t=r+1 


oO 


This elementary proposition is used in the following fundamental way. Fix two 
weights w,o. With probability one, a random D is admissible for both w and o. 
Indeed, the collection of points that are point masses for one of the two weights is 
a fixed countable collection of points. And any fixed point has probability zero of 
being an endpoint of an interval in D. Hence, we can, with probability one, define 
the Haar basis adapted to these two weights. Write the identity operator on L?(o) by 


Pooodl + Phaaf Where Paoogf = GAP ohf. 


IED : Tis (€,r)-good 


Use the same notation for the weight w. 


Proposition 2.2 There holds 


7 2 = 
Phat lle Se" 2 Fs 


Proof The location of J and the property of J being bad are independent, hence 


Pena lle = ED) Visa fC)” = Pras FO? = Praallf lle 


IED IED 


and then the proposition follows. oO 


Lemma 2.3 For any constant 1 < C < oo, 0 < € <1, there is a choice of re N 
sufficiently large so that this holds. Let w,o be a pair of weights for which the 
constant H and the constant N in (1) are finite. Suppose there holds uniformly over 
admissible dyadic grids D, 


(Ho Peoodl > Peooa8)wl S CHF llollgllw. (12) 


then, N < 2CH. 


Proof Use Proposition 2.2 on the good and bad projections, as written and the same 
version for L7(w). 


(Hof: 8) wl < E{| (Ho P oooat : Peoa8) wl + | (Ha P Socal: Phaas) wl 
+ |(HoPoaals Pesca) wl + |(HoP Seals Pass) wl}- 


The first term is controlled by the assumption (12), and the remaining terms are 
controlled by the finiteness of N and average-norm estimate on the bad projection. 
By appropriate selection of f € L*(c) and g € L?(w), there holds 
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Ni, = CH +Ce712-"PN,,. 


For any fixed ¢, we can take r > e~! log e~!, so that the second term can be absorbed 
into the left hand side. Oo 


Context and Discussion 


The random grid method was pioneered in [36], and is a critical tool in non- 
homogeneous analysis [56], where the weights need not be doubling. It has 
a broader set of uses, as witnessed by a powerful representation of a general 
Calderén-Zygmund operator as a rapidly convergent sum of dyadic operators due 
to Hytonen [13]. 

The parameterization of the grids used here follows Hyt6nen [12], but the 
statistics of this parameterization are those of the random shift in Nazarov-Treil- 
Volberg [35, 36]. 


Necessary Conditions 


Herein, we take up the necessity of the Az condition from the norm inequality. 
Following that is the monotonicity property, an essential property of the Hilbert 
transform, and then showing the necessity of the energy inequality from the Az and 
interval testing condition. The energy inequality is foundational to the proof. 


The Az Condition 


The A» condition has different forms, and so we clarify the language associated with 
the Az condition here. The simple Az condition is 


a ee 


the supremum formed over all intervals J. This reduces to the classical Muckenhoupt 
condition if w(dx) = w(x)dx, where w(x) > 0 ae., and o(dx) = w(x)~!dx. Next, 
are the half-Poisson conditions: 

wi) 


sup P(o, 1) —— < oo 
I [Z| 
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Finally there is the full Poisson Az condition 


sup P(o, 1) - P(w, 1) < co (13) 
I 


and of course, we only use the Poisson condition with holes, of Hytonen [15]. 
We verify that the Poisson Az condition (3) is necessary for the two weight 
inequality (1). 


Proposition 3.1 Assume that the pair of weights do not share a common point 
mass, and that the norm inequality (1) holds. Then, the Az condition (3) holds. 


Proof Fix the interval J = (a, b) as in (3), and let a € J. We will estimate half of 
the Poisson integral of w using the notation 


[Z| 


(i+ disete. eh ” 


pr(x)? = 
so that P(w- [b, 00), 1) = ||p;- [b, 00) ||? Below, we estimate the right half of the 


L2(w)" 
Poisson integral of w. 


o(I) ise 1 1 
[71/2 EW psa II \7| + dist(x, J) y-x wiatay) 


= (Ho), Pr)w S No (1)! |Iprllw. 


Rearranging, 
I! 
a - P(w- (a, 00), 1) S N2. (15) 
Clearly, the same inequality holds for (—oo, a]. oO 


The Monotonicity Principle 


Certain kinds of off-diagonal estimates for the Hilbert transform have concrete 
estimates in terms of the Poisson integral. This estimate makes this precise, and 
shows moreover that we need not be that careful about exactly which function 
appears in the Poisson integral. It is at the core of the entire proof. 


Lemma 3.2 (Monotonicity Principle) Suppose that the two weights o and w 
satisfy the Ay bound, and neither has a point mass at an endpoint of I. Let J C I. 
There holds for any g € L?(J, w), with w-integral zero, 


P(o(R-D.D(—.3) S (H(R-1).B)w- (16) 


Fite 
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Fig. 5 An illustration of the slope = i" Pig, J) Hu 
monotonicity principle 


Here, § = > p|g(J’)|h',, is a Haar multiplier applied to g. Suppose that J C 1 
is good, with 2"|J| < |I|. Then for any two compactly supported weights |v| < | 
supported off of the interval I, there holds 


x 


(Hv, g)wl S (HM, B)w = PUL ING sh (17) 


Note that in the first estimate, the Poisson term is always estimated above by 
an inner product involving the Hilbert transform. In the second, note that the inner 
product can always be made larger by making the weight positive. Moreover, under 
moderate assumptions on the support of the weight, the first inequality can be 
reversed. See Fig. 5. In that figure, the function ju is outside of 2""—°J, so that Hy is 
a smooth increasing function on J. Moreover, the derivative of Hz is approximately 
|J|~!P(w, J). So, if we form an inner product with the Haar function h'’, we only 
need to be concerned with the linear approximation to Hj. However, the conditions 
to get the reversal are particular, and this drives the case analysis in different sections 
of the proof. 


Proof We consider the first estimate. By linearity, it suffices to consider the case 
of g(x) = hAy(x), for J C J, and indeed we can take J = J. We need to separate 
the two weights involved. The A, condition is the only condition needed for the 
weak-boundedness principle, Proposition 7.4. Applying it in this setting, notice that 
it shows that for A > 1, 


(Ho (AL — 1), hi)| S Ay? VoQI—D. 


The assumption that o does not have mass at the endpoints of J implies that 0 (AJ—1) 
can be made arbitrarily small, as A | 1. Therefore, it suffices to consider H, (R—AJ), 
for some fixed A > 1. 
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Then estimate 


1 
(H°(o - (R— AD), bY) = [ a / Sa yhi @) wldsyo (dy) 


l 1 Ww 
7 (ee _ = (x) w(dx) o(dy) 


= [ ; / en (x) w(dx) 0 (dy) 


[| X— Xj 
2 F w 
7 La ji-aione 1 


= P(o -(R—-AN), Hi AT), 


Ml 


Here, x, is the center of J, and it can be inserted for the usual reason that hy has 
w-integral zero. Then, use the fact that (w—.x,)hY > 0, and that (y—x)(y—x,) > 0. 
So (16) holds. 

The second inequality (17) comes with the assumption that J C J, 2"|J| < |J|, 
whence dist(J,/) > |J|‘\Z|'~* => 2"¢-©|J|. Namely, the support of h¥’ and that of 
4 are separated. Then, inserting a constant as we can since the Haar function has 
integral zero, 


: 1 1 F 
(Hv, hy = / ; 4 (i vd) wlan 


4 x] Ww 
= | reer es ad 


Notice that the integrand is non-negative, hence we can make the integral bigger 
in absolute value by replacing |v| by jz. This is the first inequality in (17). For the 
second equivalence, by the separation in supports, we have 
the range of integration. And this finishes the proof. 


(yx) (y—x7) = (y—xy)? m 


The Energy Inequality 


The energy inequality is phrased in terms of the quantity 


WwW aw [2 = Ww 
E(w, D? := [| PEP |x- 2 —Epx|" = 0? D> x hy). (18) 


J: JCI 


Lemma 3.3 (The Energy Inequality) For any interval Ip and any partition P of 
Ip into intervals such that neither o nor w have point masses at the endpoints, there 
holds 
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2Poth \ 1). DP Ew. IP w() = CoH? (lo). (19) 


Here, Co is an absolute constant. 


Proof It follows from (16), viewed in dual fashion, that 


P(o(lo \D, DPE, D?wD) S WH (o Uo \ D) “Thi, 
S Ao - Io) <M, + WH -D) TN, 


SIH +h) - I, + Po. 


Above, we have appealed to the testing assumption (4). Summing over J € P, the 
second term above is clearly no more than J*o (Jy). And the second term is no more 
than 

|JA(o + Io) - Jolly, < T° (Io). 


w— 


Context and Discussion 


In the absence of common point masses, the necessity of the full Az condition, 
namely (13), was easily available, with an argument of Sergei Treil already pointed 
out by Sarason in his note [51]. This argument, based upon complex variables, 
has close analogs in [38, 56]. A real variable proof is in [20], it is essentially an 
elaboration of the argument in the early paper of Muckenhoupt and Wheeden [33]. 
Despite the necessity, only the half Poisson A, condition is used, together with 
testing, in the proof of sufficiency, in the case of no common point masses. 

Higher dimensional extensions of the A, which are not straight forward, are 
discussed in [19]. There are notable distinctions important to higher dimensions. 
First, the necessary Poisson type condition only comes in its ‘half’? form. Second, 
the power on the Poisson kernel comes as the square of the dimension of the kernel 
involved, a feature familiar from the analysis of reproducing kernel spaces. Third, 
the degree of the Poisson kernel matches the important derivative Poisson decay, 
important to energy considerations, only when the dimension of the kernel is one. 

The energy inequality was influenced by the following assumption placed upon 
the pair of weights in [38, 56]. Assume that there is a finite constant so that for all 
intervals Jp, and all partitions P of J into intervals, 


» P(o + Ip, 1)? w() < P20 (I). (20) 
IeP 
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Also assume that the dual inequality holds. In the language of Nazarov-Treil- 
Volberg, this is the pivotal condition. They proved 


Theorem C Assume that w and o do not share a common point mass. Then, there 
holds N < Ay? + TH+ P. 


This is a very strong Theorem, with an important proof. It decisively used the 
tools of non-homogeneous harmonic analysis, namely random grids, and good- 
bad projections. The pivotal condition controlled certain degeneracies in the pair 
of weights, compare to Definition 4.3. To illustrate the difficulties in the general 
case, we prove this theorem in section “Proof Under the Pivotal Assumption”. 

The pivotal condition holds if the pair of maximal function estimates hold, 
namely M, : L*(o) + L?(w) and My, : L?(w) + L’(o). This is easy to see. 
From (20), 


DEP fo, Pw) < DF inf M(o - Io)’ w() 


IeP IeP 


< | M@-h) dw < o(h), 
Io 


by the assumed norm bound on the maximal function. One sees that Theorem C 
offered a complete characterization of the two weight inequality for the triple of 
operators (H,,M,, M,,). If the pair of weights are doubling, then the boundedness 
of the maximal functions is a consequence of the Az condition.” The full character- 
ization of the boundedness of the Hilbert transform was thus known for doubling 
measures. See [56]. 

The pivotal condition is generic in the following sense. Assuming the pivotal 
condition, the Hilbert transform can be replaced by a generic Calder6n-Zygmund 
operator with one derivative on its kernel. This, and its extension to operators with 
a rougher kernel, was fundamental in the paper [45], whose main result was an 
important intermediate one in the solution of the Az conjecture [13]. 

Nazarov-Treil-Volberg, in language reminiscent of Sarason, wrote that “perhaps 
the pivotal condition is necessary’ for the boundedness of the Hilbert transform. This 
turned out to have a strong measure of truth, in that using the specific structure of the 
Hilbert transform, the energy inequality was shown necessary in [20]. Note that one 
can formally obtain the pivotal condition (20) from the energy inequality (19) by 
raising the energy term E(w, J) to the zero power, rather than the necessary power 2. 
The paper [20] then adapted the approach of [38, 56], essentially imposing a new 
weaker condition on the pair of weights in which one raised the energy to a power 
intermediate between 0 and 2. In addition, that paper provided an explicit example, 


? Alternatively, under the assumption of w being doubling, check that the energy satisfies E(w, 1) = 
1, with the implied constant depending upon the doubling constant. Thus, the necessary energy 
inequality implies the pivotal condition. 
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ie Lb ia 


Fig. 6 The function wie are graphed for three separate intervals 


recounted in section “Example Weights”, that showed that the pivotal condition (20) 
is not necessary for the boundedness of the Hilbert transform. 

The energy inequality is rather subtle. The Poisson term P(o,/) can be much 
larger than the simple average, but this is compensated for with the terms 
E(w, 1)?w(). The Fig.6 is offered to provide some insight into the ‘long tails’ 
that the Poisson term can have. 

Another indication of this subtlety is the observation that the energy inequality 
will not follow from just the A condition. Given interval Jp, and partition P of Ip, 
one can write 


a P(o, 1° E(w, D?w() < Ad you -P(o, 1)? 


leP IeP 
ar 
=A [x o(dx). 
Sy SS (Ul + dist, DY 


To finish, one would have to know that the function inside the integral is bounded. 
But, this is not true in general. Though a very tame BMO function, this fact does 
not help, since o is a general measure, and need not satisfy any Ago type condition. 
Indeed, the proof of the main theorem would be more or less classical if the weights 
satisfy a Ago type conditions. 

The monotonicity principle, Lemma 3.2, was noted in [21]. It, with the energy 
inequality, are essential aspects of the proof. 


Global to Local Reduction 


Our aim is to prove the estimate (12), 


sup| (Ho Peooal > Powel 3 S Hllfllo lgllw- 
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That is, the bilinear form only needs to be controlled for (¢, r)-good functions f = 
Prooal and similarly for g, goodness being defined with respect to a fixed dyadic 
grid. Suppressing the notation, we write ‘good’ for ‘(e, r)-good,’ and it is always 
assumed that the dyadic grid D is fixed, and only good intervals are in the Haar 
support of f and g. We clearly remark on goodness when the property is used; any 
value of O<é< ‘ is sufficient for our purposes. The symbol ¢ is kept throughout, 
as a guide to the appearance of the good property of intervals. 

The inequality above is reduced to the local estimate, (32), at the end of this 
section. It is sufficient to assume that f and g are supported on an interval J°; by 
trivial use of the interval testing condition, we can further assume that f and g are of 
integral zero in their respective spaces. Thus, f is in the linear span of (good) Haar 
functions hf for I C I °, and similarly for g. 

The distinction between J C J andJ € I (J C I and 2’|J| < |J|) forces some 
case analysis. This is further simplified by this assumption on the Haar supports of 
f.g. There are two integers sy, Sz such that 


f= DD a (21) 


1:1cI° 
logs ||Es¢+rZ 


and similarly for g. Thus, the lengths of the (good) intervals J are restricted to an 
equivalence class mod r, which is to say that the scales of f are separated by r, 
and the same for g. This will be a convenience at a few technical points below. Set 
Dy = {I : logy|I| € sp — 1 + rZ}, so these are the children of the intervals that 
appear in (21). Due to the probabilistic way in which the grids are constructed, we 
can further assume that /° € Dy. Also set Dy := {I : logy|I| € sg + rZ. 

We are to control the bilinear form 


(Hefialw = >”. GAT Aree. (22) 
TJ: 1,JCI 


The sum is broken into many summands, as is typical in these arguments, but the 
manner in which it is done has some important points below. The most important of 
these are the two ‘triangular’ forms 


Bove ce g) = > > inf AY. : (Aol, AY 8) w (23) 


T:1CP J: JGly 


and the dual form, B>“(f, g). Here, J € I means that J C J and 2’\J| < {J|, 
in words ‘J is strongly contained in 7’. And the interval J; is the child of J that 
contains J. Goodness of J justifies the use of this condition. A basic fact, proved in 
section “Elementary Estimates”, is 


Lemma 4.1 There holds 


| (Hof, 8)w — BY" (f, g) — BP" (f, 8)| S Hlfllollgllw- 
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Thus, the main technical result is as below; it immediately supplies our main 
theorem. 


Theorem 4.2 There holds 


|panere oF 2g)| Ss Hilf lle llgllw- 


The same inequality holds for the dual form B’*'" (f, g). 


The remainder of this section is devoted to a reduction of the global Theorem 4.2 
to a local estimate described in section “The Stopping Form”. In the local estimate, 
the function f is more structured in that it has bounded averages on a fixed interval, 
and the pair of functions f, g are more structured in that their Haar supports avoid 
intervals that strongly violate the energy inequality. Still the argument to control this 
term requires a subtle recursion. 

We construct stopping data, which accomplishes two ends, in that it will control 
certain telescoping sums of martingale differences of f, and that it controls certain 
degeneracies in an energy estimate on the weights. 


Definition 4.3 Define F, the stopping intervals, recursively by initializing /° € F, 
and in the recursive step, if F € F is minimal, add to F the maximal subintervals 
F' C F, with F’ € Dy, either 


f stopping E%,|f| > Cay(F) = EZ|f\. 
Energy Stopping ||H, (F \ F’)- F'||?, > CHCo(F’). 


That is, we stop if either the average of f becomes too large, or, essentially, the 
energy condition becomes too large. 


For appropriate constant C, it follows that F is o-Carleson, namely 


Yo off) < Gol), =F EF. (24) 
F/€F : F’CF 


Many properties of the o-Carleson property are used below. But, also note the 
following property: 


IE7f| < Cop(-l) (25) 
We will use the notation 
fe > AR Fes. (26) 
I€D :a¢l=F 


and a dual projection Q/'g, is defined similarly, but importantly, we replace 7-J = 
F by 17J = F, meaning that F is the smallest interval in ¥ such that J € F. (Note 


that both are projections, but Pzf is a structured function, while Q¥g is not.) The 
o-Carleson property allows us to estimate 
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Yo fap(F)o(F)"? + [PEF lle} OPgllw 


FeF 


1/2 
< Yo fay(FYo(F) + P22} x So ior S Wfllollgll- 

FeF FeF 
(27) 


We will refer to as the guasi-orthogonality argument. It holds only under the 
assumption that the projections Q¥ are pairwise orthogonal. It is very useful. 

We henceforth concentrate on the ‘above’ forms, with all considerations applying 
in their dual formulation to control the ‘below’ forms. Return to the double sum (22), 
and define 


Bel. 8) = | BP" (Pas, Org). 
Fer 


The global to local reduction is: 


Corollary 4.4 (Global to Local Reduction) There holds 


sal ar g)- Bee, 8)| S Hllfllellgllw- 


Proof Observe that B*>°’*(f, g) is a sum over pairs of intervals (J, J) with J € 1, 


whence z7J C aI. Now, the case of z7J = zl is contained in the form 
bE Ns (f, g), hence we need only concern ourselves with the case of wzJ © ari, 


that is, we need only bound 


> > BP (Pe, OF 2). 
FEF FleF 
F'CF 


Set gr ‘= Olfg. The sum in question is 


>> DS BLAIS: (Hole, 8r)w- (29) 


FEF 1: IDF 


We invoke, for the first time, the Hilbert-Poisson exchange argument: (a) Replace 
the argument of the Hilbert transform by a stopping interval. (b) Invoke the stopping 
tree construction to control the sum of martingale differences of f. (c) Apply interval 
testing, on the stopping interval. (d) Use the monotonicity principle to dominate the 
complementary term in terms of a Poisson integral. (e) Analyze the Poisson term. 
(f) Use quasi-orthogonality, as needed. 
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The argument of the Hilbert transform is J, the child of J that contains F’. Write 
Ip = F + Up — F), and use linearity of H,. Note that by the standard martingale 
difference identity and the construction of stopping data, 


TO cog 
| y Tp AY 


I: IDF 


<a/(F), FeF. 


Hence, invoking interval testing, 


| >» a ed (HoF, gr)w = > ay (F)| (HoF, gr) w| 


FeF I: IDF FeF 


SHY a (oF) |Igrlhw- 
FeF 


Quasi-orthogonality bounds this last expression. 

For the second expression, when the argument of the Hilbert transform is Ir — F, 
is the objective of section “The Remaining Part of the Global Estimate”. We have 
proved 


S Hlfllollgllw- (30) 


ps De or Art (Holr, &F)w 


FEF I: IDF 


This completes the Hilbert-Poisson exchange argument. 


| ELAR -Ur-FH]Se= Dah), FEF. 


I:1DF FleF 


Therefore, the monotonicity property (17) applies, and yields 


| So ERA Holle—F). gr] So Polo S Wel «Fe. 
1:1DF JEI*(F) 
(31) 
Here $7 = ye T(F):JE rlg(J)| - 2, so that every term has a positive inner product 
with x, and 7*(F) are the maximal good intervals J € F, and J € Dy. (If J ¢ Dg, 
then (g, hy) = 0, by choice of g at the beginning of the proof.) 
The control of the sum over F € F of (31) oO 


It remains to control Pepsi tg ,g). Keeping the quasi-orthogonality argument in 


mind, appropriate control on the individual summands is enough to control it. To 
describe what has been done, one must note that the functions P&f need not be 
bounded. But, we are only concerned with averages over intervals where the average 
will be bounded. In addition this function and Og are well-adapted to the pair 
of weights w,o. The next lemma, combined with the quasi-orthogonality estimate 
clearly completes the proof of the Theorem. 
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Lemma 4.5 (The Local Estimate) For each F € f, there holds 


[Be"" (Pet, Org)| < Hlay(F)o(F)'? + ||Pef loll Orsllw- (32) 


The first step in the proof of the Lemma above is to invoke the Hilbert-Poisson 
exchange argument again, but we will arrive at a Poisson term which falls outside the 
immediate scope of the energy inequality. Focusing on the argument of the Hilbert 
transform in (32), we write [; = F — (F —I,;). When the interval is F, and J is in the 
Haar support of OF g, notice that the scalar 


ap(Fey:= SD) EZAY 
I: JGICF 


is bounded by an absolute constant, by construction of the stopping intervals. 
Indeed, by the telescoping identity for martingale differences, 


a(Fles= >) ELAf=EZf, 


T:I~GICF 


which is at most Ca(F), since zJ = F. Therefore, we can write 


| oY BT Are HF. Ays)| sa/(P)|(HoF, Yo esA¥s), (33) 
T:ICFIJ:JEl J: JGF 
< Ta(Fo(F)'?| > eye < To)" sll. 
J:JGF 


(34) 


This uses only interval testing and orthogonality of the martingale differences, and 
it matches the first half of the right hand side of (32). 

When the argument of the Hilbert transform is F — 7, this is the stopping form, 
the last component of the local part of the problem. It requires a subtle recursion, 
described in section “The Stopping Form”. 


Context and Discussion 


Many 71 theorems have arguments, sometimes subtle ones, about telescoping 
sums which collapse. These arguments are systematically handled herein with the 
stopping data, as opposed to more intricate Carleson measure arguments. 

The use of the energy stopping intervals is motivated by the use of the 
corresponding intervals, under the pivotal condition (20), in [38, 56]. However, the 
pivotal condition is not necessary for the two weight inequality, while the energy 
inequality is necessary from the A» and interval testing conditions. 
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Initial arguments had largely ignored the structure of the pair of functions f, g 
in the inner product (H,f, g),,, instead concentrating on proving an intricate series 
of Carleson measure type estimates. This changed with the argument of [21], which 
introduced Calder6én-Zygmund stopping intervals, and the quasi-orthogonality argu- 
ment into the subject. It was only then that the role of the global to local step was 
identified, but not proved. Stopping data also allows us to avoid the subtle problem 
of absence of canonical paraproducts. Attempts to introduce them induce ad hoc 
elements into the proof. 

This section begins with the elementary and familiar Lemma 4.1, and then argues 
that the control of the triangular form B*”°Y*(f, g) splits into the ‘global to local’ and 
the ‘local’ part. The authors of [23] only had the first reduction. And, using the 
techniques of that paper, could prove 


Theorem D ([23]) There holds |B*°’*(f, g)| < {H + BoohMlfllellelly 


~N 


where H = Ay * + T, and the remaining constant is the best constant in 


IB"(F, 8) S Boo (Io)"/?Ilelhw. 


where |f| < 1,,, and Ip is any interval. The corresponding estimate holds for the 
dual from B’°'” (fg). 


This is a powerful Theorem, strongly suggesting that the Az condition and testing 
the Hilbert transform over bounded functions is sufficient for the L? boundedness of 
H,. But, there is no obvious way to deduce such a result from the Theorem above. 
Phrasing things differently, it can be very difficult to translate partial information 
about the triangular form B*°’*(f, ¢) to information about (Hf, g), a potentially 
serious obstacle if a richer theory of two weight inequalities for singular integrals is 
to be developed. 

The parallel corona was introduced in [22] to surmount this obstacle. With it, the 
result that could be proved the first real variable characterization of the two weight 
inequality for any continuous singular integral. 


Theorem E (Lacey Sawyer Shen Uriarte-Tuero [22]) There holds N ~ Aj! A oe 
Too, where the latter constant is the best constant in the inequalities below, uniform 
over all intervals I, and Borel subsets E C I. 


[iteter dw < T2,a(D), [eter do < T2,w(I). 
T I 


(One tests the Hilbert transform on 1g, but only the weight of the interval I appears 
on the right.) 


The parallel corona delays the application of Lemma 4.1, this feature combined 
with a special function theory specific to Haar expansions for non-doubling 
measures, were the critical ingredients. 
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The parallel corona has been used to give short transparent proofs of two weight 
inequalities for singular integrals. See the last page of Hyt6nen’s survey [14] and 
the article of Tanaka [55]. 

It is natural to wonder if there are any L’ analogs of the main Theorem. We have 
some clues as to how this might work, in the more complicated testing conditions 
of Vuorinen [57, 58]. One could see that the global to local reduction would work 
under variants of these more complicated testing conditions. The control of the local 
term is however a heavily Hilbertian argument, and so potentially very difficult to 
extend to an L’-setting. 


The Remaining Part of the Global Estimate 


The last part of the global-to-local part of the arugment is this Lemma. 


Lemma 5.1 Using the notation of section “Global to Local Reduction”, there holds 


| 2 YD BAR: Holle \ F). aru] S Kllflle ls. (35) 


FeF I: IDF 


Our method of proof has these elements. (a) Use monotonicity to pass to a 
positive operator. (b) Identify the inequality needed as an instance of a two weight 
inequality, but not for general functions, only one fixed function, and a derived 
weight 44 = [o,y,f that is well-adapted to the function; (c) Invoke the parallel corona 
method to prove the desired two weight inequality. Along the way, we will identify 
simplifications of the general case of a two weight inequality for a positive operator. 

Begin the proof by observing that 


| 0 EAT GF SO= alr) F, FEF. 
1:1DF FGF 


where ee = F’ \ F”, with F” being the F-child of F’ that contains F. Also, by 
monotonicity, the left-side of (35) is at most 


P( YS w()- Fy. J") > (gay HF), Hes: 


FEF J*€IJ* (F) FoeF J:JICs* 
F'DF aFJ=J* 


The desired estimate is a consequence of new L?-estimate for the modified 
Poisson operator 


Pen =f oP wy (36) 
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which is extended to Pf(I) = Pf(x;, |I|). The relevant measure on the upper half- 
plane is given by 


B= >> DO Syl DO OAM. (37) 


FEF J*€IJ*(F) J:JICs* 
utrJ=J* 


Finally, the estimate we need is as below, in which we have eliminated the sum of 
JEeF*. 


| > YF’) SS Pp) Or} S Hllflle- (38) 


JET*(F) 


FleF pb 
FeChs(F’) 
Here, Chz(F’) is the collection of F-children of F’, and Q; = J x [0,|K]] is the 
Carleson box over interval J. 

This last inequality is in fact universal, in that we could fix the measuer jz, replace 
f by an arbitrary function, and the inequality is still true. But this fact is not needed. 
And, we can use the fact that f and the measure jp are related through the stopping 
data, to simplify the proof of (38). 

Our knowledge of two weight estimates suggest that the inequality (38) is easiest 
to prove by duality, and using the joint stopping data on f and the dual function 
yeV (Ri. 4), a technique refered to as the parallel corona. We will reduce the 
inequality (38) to two testing inequalities. One will be a reformulation of the energy 
inequality and the other will be a consequence of the Az condition. 

By duality, the inequality we establish is 


~ = 
Yok’) SD PF.) i y de S Hflelly le (39) 
FICF JET * (F) Qs 
FECh;(F’) 


Here, y is a non-negative function, supported on a Carleson cube Q;,, where Jo € 
J* := Urer J*(F). We construct stopping intervals G for y, by initializing G = 
{Jo}, and setting a;,(g) = © On y. In the recursive step, for minimal J € G, we add 
to G the maximal subintervals J’ ¢ J with J’ © J* such that a,(J') := 10V > 
10a,(J). We let zg/ be the minimal element of G that contains /. 

Now, in the sum (39), a given interval J that occurs satisfies either 7gJ C F’ 
or F’ © agJ. (Keep in mind that there could be many intervals G € G that lie 
between J and F’.) This division splits the sum into two terms, the first is the sum 
over F’ € F of 


a(F’) SY  S* PR.) i: y du. (40) 


FeCh#(F’) JE J * (F) 
mgJCF’ 
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And the second is sum over G € G of 


~ 
a a a) ec 
FCF FeChz(F’) JE 7* (F) al 
ao F’=G igi= 
F’#G 


The first testing inequality is this inequality, uniform over F’ € F. 


1/2 
40 a Po(F)'"| » o@Fn(00) (42) 


GEG 
a¢G=F’ 


That this completes the bound of (40) is an immediate consequence of quasi- 
orthogonality. By Cauchy-Schwarz applied to the right of (42), note that 


Y aFPOF) SIAR. 


FeF 


and as well, by the construction of the stopping data for y, 


Ye a (GP n@e) § Iv 
FEF GEG 
a¢G=F’ 


This completes half of the proof of (39). The other half follows from the second 
testing inequality: Uniformly in G € G, there holds 


1/2 
5 ay Gn(00)"| » ont Fatt)| : (43) 


FleF 
ag F’=G 


It is bounded again by quasi-orthogonality. It remains to prove the two testing 
inequalities (42) and (43). 


Proof of (42) This is just the energy inequality. By construction 
(40) Sa(F) Do SY) ay (agJ)Po(F’ \ F. Dees, Dp). 


FeChs(F) Je J* (F) 
mgJCF’ 


Of course we use Cauchy-Schwarz on the right above. Recall the definition of jz, to 
see that this inequality 
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DYE Polk \ FP eG, WDD) 
FeCh#(F) Je J*(F) 
ag JCF’ 


Yo > PEE IP DD ay, S ICo(F) 


FECh#(F) JE7*(F) ICI 
mgJCF’ 


is simply a reformulation of the energy inequality (19). 
The other part of the application of Cauchy-Schwarz is 


Yo DE ere uk, WD) S Yo a (tg J)? ue). 
FeChs(F) Je J* (F) GEG 
mgJCF’ ae GE{F }UCh#s(F’) 


This completes the proof of (42). Oo 


Proof of (43) In (41), we dominate toe y du < 10 EGY - w(Q,;), and then 
express (41) using the dual to the operator P defined in (36). We have 


(41) SEbyx So oF) So SS Pp. Dus) (44) 
FleF FeECh#(F’) JE J *(F) 
1g F’=G agJ=G 
F’A4G 
= toy xf Yo SO oF) Fe SS PH(Q)) do (45) 
aree FECh#(F’) JET*(F) 
tgJ=G 
ee 


Apply Cauchy-Schwartz in the variable F’, and L?(o). One of the terms that result is 


y a (F’)°o (F’). 
FleF 
agF’=G 


Compare to the right side of (43). The other term is the following inequality, holding 
uniformly in G € G: 


2 
[ oy a 2 Fy-Fucon| do SAzu(Qe). (46) 


ae FECh#(F’) JET * (F) 
1gk’= tgJ=G 
hee 


As the inequlaity shows, this follows from the Az condition. 

An obstacle to a proof is that the sets Q; overlap. This is addressed with the 
definition W, = J x (277!, 27], for j > 0. These sets are disjoint in j and J. We 
will then show that for each F’ € F, 
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2 
al Se Fy Faw) dos27A, SY > > u(W). (47) 


FeCh#(F’) JE J *(F) FeCh#(F’) JES * (F) 
agJ=G agJ=G 


This easily implies (46). 

Two additional summing variables are convenient. For integers k > r, we restrict 
the sum to J € J*(F) with 2*|J| = |F|. And, for integers £ > k(1 — €), we further 
require that 


2 J| < dist(J, OF) < 2°|J]. (48) 


By goodness, £ > k(1 — €), but there is in general no other condition that we 
have here. Then, we prove this estimate, which is (48), with these two additional 
restrictions on J. Uniformly in F’ € F, 


2 
[ > » re Fv do S27 "A, YY uM). 


FeChe(F) JE Z*(F) FeChF(F") JET*(F) 
ee 1gJ=G 
2*|J|=IFI, (48) holds 


(49) 


In (49), there are at most 2° intervals J. We can therefore pass the square inside 
the sum, at cost of a factor of 2°. But, 


[ PLOW)? doa) S n(W}) [ Hp deo)do(s) 50) 


1 
i b3 + le-yil 


(Wi)? ~ 


S27 in (Wi)w)Po (Fed) S 277 Aau(Wi). (52) 


Here, we have used Cauchy-Schwartz, followed by the estimate below, which holds 
for x € Fr, 


u(W4) 
2 F2(\I|2 + |x —xy/2) 


1 
=e CO 
wi bys + lx—-m PP 
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Then, besides disjointness, the sets W, enjoy the estimate .(W) < 2-7/|JP-w(), 
which follows from the definition of jz in (37), and the estimate |x, |? < |J|?w(J). 
Finally, we just appeal to the A condition. The bound in (52) is multiplied by 2°, 
to prove (49). This finishes the proof. Oo 


Context and Discussion 


The inequality (38) is universal. This was first proved in [23], in the case that the 
weights did not share a common point mass. It was down by appealing to the Sawyer 
theorem [53] on two weight inequalities for the Poisson operator. This technique 
does not allow common point masses, however. Addressing this, Hyténen [15] 
found a clever way to use dyadic approximates to the “Poisson operator with holes,’ 
by using dyadic approximates to an arbitrary interval, and prinving a novel dyadic 
two weight inequality. 

The proof herein does not attempt to prove the universal form of (38). Indeed, 
this inequality is not needed. Indeed, the close relationship between the function f, 
and the derived measure jz in (37) permits a short self-contained proof. 


The Stopping Form 


The last step in the proof of Theorem 4.2, hence in the proof of the main theorem, 
is to show that the local inequality (32) holds. Using the discussion at the end of the 
previous section, this amounts to controlling the stopping form. Given an interval 
F ¢ Ff, the stopping form is 


Be’.a= YY DY EARS (Ho(lo— 11), Ai )w. (53) 


Iin¢l=F J: J€El,a¢I=F 


Lemma 6.1 There holds for each F € Ff, 


[Br (Ff. 8)1 S HIP Pf llo lOve lw. (54) 


The stopping form arises naturally in any proof of a T1 theorem using Haar or 
other bases. In the non-homogeneous case, or in the Tb setting, where (adapted) 
Haar functions are important tools, it frequently appears in more or less this form. 
Regardless of how it arises, the stopping form is treated as a error, in that it is 
bounded by some simple geometric series, obtaining decay as e. g. the ratio |J|/|/| 
is held fixed. (See for instance [38] (7.16).) 

These sorts of arguments, however, implicitly require some additional hypothe- 
ses, such as the weights being mutually Ago. Of course, the two weights above can 


Two Weight Hilbert 43 


be mutually singular. There is no a priori control of the stopping form in terms 
of simple parameters like |J|/|Z|, even supplemented by additional pigeonholing of 
various parameters. 

Our method is inspired by proofs of Carleson’s Theorem on Fourier series 
[5, 10, 25], and has one particular precedent in the current setting, a much simpler 
bound for the stopping form in [22]. 


Admissible Pairs 


We can assume that f = PZf and g = Qe. For all pair of intervals J € J C F 
that we need to consider, we have z7J = F, and hence by the Energy Stopping 
condition, there holds 


P(o(F —1y), 1)E(w, 1))°wUy) < Co(1y). (55) 


For if not, by monotonicity (16), we would have that the interval J; would be an 
energy stopping interval, hence J; € F, and J = 17. It is this condition that is our 
starting point for the recursion. 

A range of decompositions of the stopping form necessitate a somewhat heavy 
notation that we introduce here. The individual summands in the stopping form 
involve four distinct intervals, namely F’, 7,77, and J. The interval F will not change 
in this argument, and the pair (/,/) determine /;. Subsequent decompositions are 
easiest to phrase as actions on collections Q of pairs of intervals Q = (Q,, Q2) with 
F D> Q; D Q». (The letter P is already taken for the Poisson integral.) And we 
consider the bilinear forms 


Bo(f,8) = )- Et, Abif : (Ho(F — (Q1)o,), A$,8)v- 
QEQ 


We will have the standing assumption that for all collections Q that we consider are 
admissible. 


Definition 6.2 A collection of pairs QO is admissible if it meets these criteria. For 


any Q = (Qi, Qo) € Q, 


(1) Q2 € Q; C F, and both Q; and Q> are good. 
(2) (convexity in Q,) If QO” € Q with OF = Q) and Of C I C Qi, with I good, 
then there is a Q’ € Q with OQ = J and Q) = Qp. 


The first property is self-explanatory. The second property is convexity in Q), 
subject to goodness, holding Q> fixed, which is used in the estimates on the stopping 
form which conclude the argument. A third property is described below. 
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We exclusively use the notation Q;, k = 1,2 for the collection of intervals 
U{Q. : Q € Q}, not counting multiplicity. Similarly, set Qi = {(Qi)g, : Q € Q}, 
and Q; := (Q:)g,- 


(3) Every interval Q, € Q, satisfies 77Q,) = F (And so, every oO; has F-parent F.) 


The last requirement comes from the assumption that the functions f and g be 
adapted to Fenergy (F’). We will be appealing to different Hilbertian arguments below, 
so we prefer to make this an assumption about the pairs rather than the functions /, g. 
The Hilbert space will be the space of good functions in L?(o) and L?(w). 

Typically, one only ever needs goodness of the small interval, in this case Qo. 
We will use the term size(Q) below, in which it will be apparent that goodness of 
the intervals Q; will be helpful. Namely, at this point goodness is used to as in the 
monotonicity principle, to estimate off-diagonal inner products involving the Hilbert 
transform by Poisson averages, and to regularize Poisson averages. Both are made 
more explicit in section “Upper Bounds on the Stopping Form”. 

The stopping form is obtained with the admissible collection of pairs given by 


Oo = {U, J) : J €1C F,land J are good, z7J = F}. (56) 


There holds Be (f,g) = Ba (f,g). 
There is a very important notion of the size of Q. 


. P(o(F — K), K)* . 
size(Q)? := sup roe a > Cala ee (57) 
KEQ\UQ> o( )| | JEQ.:ICK 


For admissible Q, there holds size(Q) < H, as follows (55). 
More definitions follow. Set the norm Bg of the bilinear form Q to be the best 
constant in the inequality 


IBa(f.g)| < Bollfllellgllw. 


Thus, our goal is show that Bg < size(Q) for admissible Q, but we will only be 
able to do this directly in the case that the pairs (Q,, Q2) are weakly decoupled in 
a collection Q. The relevant decoupling is precisely described in section “Upper 
Bounds on the Stopping Form”. 

Say that collections of pairs Q/, for j € N, are mutually orthogonal if on the one 
hand, the collections (Q/)», of second 1 coordinates of the pairs, are pairwise disjoint, 
and on the other, that the collections(Q/), are pairwise disjoint. The concept has to 
be different in the first and second coordinates of the pairs, due to the different role 
of the intervals Q; and Q>, which comes up again in the next paragraph. 

The meaning of mutual orthogonality is best expressed through the norm of the 


associated bilinear forms. Under the assumption that Bg = > jen Bay, and that the 


{Q’ : j © N} are mutually orthogonal, the following essential inequality holds. 
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Bo < V2supBo). (58) 
jeN 


Indeed, for j € N, let I;” be the projection onto the linear span of the Haar functions 


{hy i Je Of}, and use a similar notation for I1?. We then have the two inequalities 


Dlayells, < Iisik. = DOMTPAIIS < 211. 


jeN jeN 


Since a given interval J can be in two collections fay , we have the factor of 2 in the 
second inequality. Therefore, we have 


IBolf.a)l < > |Baif.s)| 
jeN 
= )>|Bo:(1?f, 11"g)| 
jen 
< So Boill M/F llol|T'sllw < 2By Wille llgllw- 
j€ 


jeN 


This proves (58). 


The Recursive Argument 


This is the essence of the matter. 


Lemma 6.3 (Size Lemma) An admissible collection of pairs Q can be partitioned 
into collections Q!@"8¢ and admissible Ove fort € N such that 


Bo < Csize(Q) + (1 + V2) sup Bosmat, (59) 
£ 


and sup size(Q?""") < isize(Q). (60) 
teN 


Here, C > 0 is an absolute constant. 


The point of the lemma is that all of the constituent parts are better in some way, 
and that the right hand side of (59) involves a favorable supremum. We can quickly 
prove the main result of this section. 


Proof of Lemma 6.1 The stopping form of this Lemma is of the form Bo(f, g) for 
admissible choice of Q, with size(Q) < CH, as we have noted in (56). Define 


f(A) := sup{Bg : size(Q) < CAH}, 0<A <1, 
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where C > 0 is a sufficiently large, but absolute constant, and the supremum is 
over admissible choices of Q. We are free to assume that Q; and Q, are further 
constrained to be in some fixed, but large, collection of intervals Z. Then, it is clear 
that ¢(A) is finite, for all 0 < A < 1. Because of the way the constant H{ enters 
into the definition, it remains to show that (1) admits an absolute upper bound, 
independent of how Z is chosen. 

It is the consequence of Lemma 6.3 that there holds 


f(A) < CA+ (14+ V2)E(A/4), b=) <1: (61) 


Iterating this inequality beginning at A = 1 gives us 


(I) <C+ (1+ V2)0(1/4) ss CD [4] <a, 


ll 
i) 


So we have established an absolute upper bound on (1). Oo 


Proof of Lemma 6.3 


We restate the conclusion of Lemma 6.3 to more closely follow the line of argument 
to follow. The collection Q can be partitioned into two collections ol@tse and Qsmall 
such that 


(1) Boje S t, where t := size(Q). 

(2) Qsmall = ore U oo, 

(3) The collection Gamal is admissible, and size(Qs™") < a 

(4) For a collection of dyadic intervals £, the collection gout is the union of 
mutually orthogonal admissible collections Os for L € £, with 


size(Qsmal! < LEE. 


T 
4 
Thus, we have by inequality (58) for mutually orthogonal collections, 
Bo s Bolarse + B oysmatly gymal 

< Bolarge + B oma + B oma 

< CT + ad + V2) max {B gsmall i. ee B sma \ é 
This, with the properties of size listed above prove Lemma 6.3 as stated, after a 
trivial re-indexing. 


In a manner similar to the argument of section “Global to Local Reduction”, there 
is an induced measure on the upper half-plane that is relevant to our considerations. 
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This time it is given by 


bo=hs= yy (x, Wey? Sccs,1a)s x, is the center of J. 
JEQ2: JCF 


The tent over L is the triangular region T;, = {(x, y) : |x —.x,| < |L| — y}, so that 


wT) = > (x, hy). 


JEQQ: ICL 
Observe that 


P(o(F — K), K)’ 


: Os 
ee o(K)IKP 


KEQ\UQ) 


[(Tk). 


All else flows from this construction of a subset £ of dyadic subintervals of F. 
The initial intervals in £ are the minimal intervals L € Q; U Q> such that 


P(o(F —L),L)’ a 
——_——_w(Tr) = —o(L). 62 
me) = Tg) (62) 
Since size(Q) = T, there are such intervals L. 7 
Initialize S (for ‘stock’ or ‘supply’) to be all the dyadic intervals in Q; U Q) 
which strictly contain some interval in £. In the recursive step, let L’ be the minimal 
elements S € S such that 


WTs)=— YI wT). p= (63) 


LEL: LCS 
Lis maximal 


(The inequality would be trivial if p = 1.) If £’ is empty the recursion stops. 
Otherwise, update £ < LUL',andS «+ {KES : K EG LVL € CL}. See 
Fig. 7. 

Once the recursion stops, report the collection £. It has this crucial property: For 
Le £, and integers t > 1, 


Yd) ev) <p u(T1). (64) 


Ln L/SL 


Indeed, in the case of t = 1, is acriteria for membership in £, and a simple induction 
proves the statement for all t > 1. 

The decomposition of Q is based upon the relation of the pairs to the collection 
L£, namely a pair Or, Q> can (a) both have the same parent in £; (b) have distinct 
parents in £; (c) Q> can have a parent in L, but not O71; and (d) Q> does not have a 
parent in £. 
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Fig. 7 The shaded smaller tents have been selected, and T, is the minimal tent with j.(7z) larger 
than p times the jz-measure of the shaded tents 


A particularly vexing aspect of the stopping form is the linkage between the 
martingale difference on g, which is given by J, and the argument of the Hilbert 
transform, F — I;. The ‘large’ collections constructed below will, in a certain way, 
decouple the J and the F — J;, enough so that norm of the associated bilinear form 
can be estimated by the size of Q. 

In the ‘small’ collections, there is however no decoupling, but critically, the size 
of the collections is smaller, and we only have to estimate the maximal operator 
norm among the small collections. 


Pairs Comparable to £ 


Define 
Or, :={2€9: mcQ1 = 2x-Q. = Li, Lely ten. 


These are admissible collections, as the convexity property in Q), holding Q» 
constant, is clearly inherited from Q. Now, observe that for each t € N, the 
collections {Q;,, : L € £} are mutually orthogonal. The collection of intervals 
(Q7.1)2 are obviously disjoint in L € CL, with t € N held fixed. And, since 
membership in these collections is determined in the first coordinate by the interval 
O; , and the two children of Q; can have two different parents in £, a given interval J 
can appear in at most two collections (Q;,,,);, as L € £ varies, and t € N held fixed. 
Define of to be the union over L € £ of the collections 


my = {Q€ On : Q FL}. 


Note in particular that we have only allowed t = 1 above, and QO; = Lis not 
allowed. For these collections, we need only verify that 
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Lemma 6.4 There holds 


size(Q3ma") < /(p—l)-t= - LEL ten. (65) 


Proof An interval K € (om"), U Q> is not in £, by construction. Suppose that K 
does not contain any interval in £. By the selection of the initial intervals in £, the 
minimal intervals in OF; U Q» which satisfy (62), it follows that the interval K must 
fail (62). And so we are done. 

Thus, K contains some element of £, whence the inequality (63) must fail. 
Namely, rearranging that inequality, and using the measure ju associated with Os 


Howat(Tr) <(p-1) D2 wT) (66) 
eae : hoes 


e |K|? -o(K) 


67 

aul T) S16 Pio(L— KK ee 

Here, note that we begin with the measure Hgsmat; USE Pp = l+ t and the last 
inequality follows from the definition of size. This finishes the proof of (65). oO 


The collections below are the first contribution to Q!"8°. Take one = = Uf ores : 
Le L}, where 


na ={QEQ1: 01 = L}. 


Note that Lemma 6.8 applies to this Lemma, take the collection S of that Lemma 
to be {L}, and the quantity 7 in (79) satisfies n < t = size(Q), by (82). From the 
mutual orthogonality (58), we then have 


B gi S < /2 sup B ee 


large SS 
LEL Qn 


rge 
2 


The collections Q;,, for L € £, and t > 2 are the second contribution to Q 
namely 


race mo =U S| Os 


LEL t>2 


For them, we need to estimate Bo, ,. 


50 M.T. Lacey 


Lemma 6.5 There holds Bo, , < p~/*t. 


From this, we can conclude from (58) that 


B oisee < > Busou, :LEL} 


t>2 


< V2) | sup Box, <a Lace 3 


12 t>2 


Proof For L € CL, let S;, the £-children of L. For each Q € Q,;, we must have 
Qo C Ms5,Q2 C OQ). Then, divide the collection Q;,, into three collections OF a 
£ = 1,2,3, where 


Or, :={Q€ 1 : Oo © 15,Qr}, 
Qi, ={Q€ Or, : Qo € Ts5,Q2 € Oi}, 


and Q} = Q,,—(Q;, U Q7,) is the complementary collection. Notice that Q7 , 
equals the whole collection Q,,, fort > r+ 1. 

We treat them in turn. The collections Qi; fit the hypotheses of Lemma 6.8, just 
take the collection of intervals S of that Lemma to be Sr. It follows that Bo! < 


B(t), where the latter is the best constant in the inequality 


Y>  P(o(F-K),J) Jee 


JE(Qr1)2:JEK 


Ub a < B(t)’o(K), KeS,,LeELl, t>2. 

(68) 
We will prove the estimate below, which is clearly summable in ¢ € N to the estimate 
we want. 


Lemma 6.6 There holds B(t) < p~'/*t. 


Proof We have the estimate without decay in 1, B(t) < size(Q), as follows 
from (82). Use this estimate for 1 < t < r+ 3, say. In the case of t > r+ 3, 
the essential property is (64). The left hand side of (68) is dominated by the sum 
below. Note that we index the sum first over L’, which are r + 1-fold £-children of 
K, whence L’ € K, followed by t — r — 2-fold £-children of L’. 


» » Y> P(o(F—K).J) ap a. (69) 


Viel L’eL = JEQzd: JCL” 
nit W=K mt SL 


™ P(o(F —K),L'? 

2 Tae Ht) ~~ 70) 
L'eL 

wot V=K wet L SL 


r 
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4) P(o(F —K),L')? 
< p t+r42 2 ROME NE REE 


Ty 71 
we M(T1) (71) 
Vel 
wot =K 
Sot? YT of) S Ppa). (72) 
Vel 
wt y=K 


We have also used (77), and then the central property (64) following from the 
construction of L, finally appealing to the definition of size. Hence, B(t) < tp~"/”. 
This completes the analysis of Os Oo 


We need only consider the collections oF. for 1 < t<r+1, and they fall under 
the scope of Lemma 6.9. A variant of (82) shows that Bo: < t. Similarly, we need 
only consider the collections oO . for 1 < t < r+ 1. It follows that we must have 
2" < |O,|/|Q2| < 27. Namely, this ratio can take only one of a finite number of 
values, implying that Lemma 6.10 applies easily to this case to complete the proof. 

oO 


Pairs Not Strictly Comparable to C 


It remains to consider the pairs Q € Q such that Q, does not have a parent in £. The 
collection oe is taken to be the (much smaller) collection 


oom = {Q € O : Q> does not have a parent in £}. 


Observe that size(Qs™") < V(e— 1)t < §. This is as required for this collection. 
(The collections al and oa are also mutually orthogonal, but this fact is not 


needed for our proof.) 


Proof Suppose 7 < size(Qs™"). Then, there is an interval K € (om) U(gsml), 
so that 


P(o(F — K), K)* 


2 
K)< 
no(K) < Kp 


HL gym (Tx). 
Suppose that K does not contain any interval in £. It follows from the initial intervals 
added to L£, see (62), that we must have n < af 


Thus, K contains an interval in £. This means that K must fail the inequality (63). 
Therefore, we have 


P(o(F — K), K)* 


no (K) < (p— 1) KE 


2 
y(Tk) < 0). 


This relies upon the definition of size, and proves our claim. Oo 
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For the pairs not yet in one of our collections, it must be that Q, has a parent 
in £, but not Q;. Using £*, the maximal intervals in £, divide them into the three 
collections 


38 = {QE Q : Or © mexQr C Oi}, a 
om — {0 EO % QO, a Te*Or E Or}, (74) 
apa ={Q€Q: OQ ¢ me*Qr € Oi, and me*Q) ¢ O}}. (79) 


Observe that Lemma 6.8, with (82), gives 
ange. 
Bowne St. (76) 


Take the collection S of Lemma 6.8 to be £*. 

Observe that Lemma 6.9 applies to show that the estimate (76) holds for Q 
Take S of that Lemma to be £*. The estimate from Lemma 6.9 is given in terms of 
n, as defined in (94). But, is at most Tt. 

In the last collection, Q!"**, notice that the conditions placed upon the pair 
implies that |Q;| < 27’+7|Q)|, for all @ € O%"**. It therefore follows from a straight 
forward application of Lemma 6.10, that (76) holds for this collection as well. 


large 
4 3 


Upper Bounds on the Stopping Form 


We prove upper bounds on the norm of the stopping form in a situation in which 
there is some decoupling between the martingale difference on g, and the argument 
of the Hilbert transform. First, an elementary observation. 


Proposition 6.7 For intervals J C L © K, with L either good, or the child of a 


good interval, 


P(o(F—K),J) _ P(o(F —K).L) 
\J| ~ [Z| , 


(77) 


Proof The property of interval J being good, says that if J C 7, and 2’—!|/| < |i], 
then the distance of either child of / to the boundary of / is at least |/|*|Z|!~*. Thus, 
in the case that L is the child of a good interval, the parent L of Lis contained in K, 
and 2’! |Z < |K|, so by the definition of goodness, 


dist(J, F — K) > dist(L, F — K) 
Sf n 228), 


The same inequality holds if L is good. Then, one has the equivalence above, by 
inspection of the Poisson integrals. Oo 
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Lemma 6.8 Let S be a collection of pairwise disjoint intervals in F. Let Q be 
admissible such that for each Q € Q, there is an S € S with Qy © S C Q,. Then, 
there holds 


IBo(f.g)| S nllfllellgllw. (78) 
1 
where 1° := sup —~ P(o(F — 8), J)*(— ye (79) 
seb (5) es iy 


It is useful to note that 7 is always smaller than the size: For S € S, let 7* be the 
maximal intervals J € Q» with J € S, and note that (77) applies to see that 


>> P@(F-S),s{ ee =) YY) POf-S), anit a. 


JEQ, JES J*ET* JEQ: ICI* 
(80) 
P(o(F — 5S), J*)? rr 
= > FPR » (x, hy), 
Jees* JEeQ:Ics* 
(81) 
S YO a") S size(Q)o(S). (82) 
J*ET* 


Proof An interesting part of the proof is that it depends very much on cancellative 
properties of the martingale differences of f. (Absolute values must be taken outside 
the sum defining the stopping form!) This argument will invoke the stopping data, 
and part of the Hilbert-Poisson exchange argument. 

Assume, as we can, that the Haar support of f is contained in Q). Take F and 
a(-) to be stopping data defined in this way: First, add to F the interval F, and 
set a(F) := EZ|f|. Inductively, if F € F is minimal, add to F the maximal 
children F” such that a(F’) = EZ,|f| > 4ay(F). This is a simple form of the 
stopping data construction in section “Global to Local Reduction”. In particular 
quasi-orthogonality (27) holds. 

Write the bilinear form as 


Bo(f.g) = > (Hoos, AV 8)w (83) 
J 
where gy:= . EVAQf-(F-Q)). (84) 
QEOD: Q=J 


The function @, is well-behaved, as we now explain. At each point x with g(x) 4 0, 
the sum above is over pairs Q such that Q2 = J andx € F— O1. By the convexity 
property of admissible collections, the sum is over consecutive (good) martingale 
differences of f. The basic telescoping property of these differences shows that the 
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sum is bounded by the stopping value a(zzJ). Let I * be the maximal interval of 
the form Q, with x € F — Qj, and let J, be the child of the minimal such interval 
which contains J. Then, 


la@l=| So EFAg,F09| 


Qo :Qo=J 
erie) (85) 


= |E7.f —EZf| S 


S af (azJ)(F — S), 


where S is the S-parent of J. 
We can estimate as below, for F € F: 


B(F)=| D> Bg, AQF: (Ho(F - 01), A¥8)w (86) 
Q€Q:nzQ=F 
2] Hoen Aie)n (87) 
JE€Q2:0¢J=F 
'S a/(F) >> >) PoO-S), MNT 7 Ava), | (88) 
SES JEQ) 
uFS= F JCS 
1/2 
SaA| YO Pow). F. ny), x Yo aur] 9) 
SES JEQ? | |’ JEQ? 
nu¢S=F JCS usJ=F 
(79) 1/2 
S size(Qha(F)| S> o(S)x YY aay? (90) 
SES JEQ, 
uFS=F uFJ=F 
< size(Qhay(F)o(F)'/?| - ay] (91) 
JEQQ. :n¢-J=F 


The top line follows from (84). In the second, we appeal to (85) and monotonicity 
principle, the latter being available to us since J C S implies J € S, by hypothesis. 
We also take advantage of the strong assumptions on the intervals in Qo: If J € Qo, 
we must have m7J = 2#(ssJ). The third line is Cauchy—Schwarz, followed by the 
appeal to the hypothesis (79), while the last line uses the fact that the intervals in S 
are pairwise disjoint. 

The quasi-orthogonality argument (27) completes the proof, namely we have 


Y= E(F) S size(Q)Ilflollgllw- (92) 


FeF 
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Lemma 6.9 Let S be a collection of pairwise disjoint intervals in F. Let Q be 
admissible such that for each Q € Q, there is an S € S with Qy C S © Q\. Then, 
there holds 


IBo(f.s)| S nllfllellglly. (93) 
P(o(Q; — 26.8), 8)? 
where 1° = sup (o@ = by Co ee (94) 
SES o(S)|S| JEQn ICS 


Proof Construct stopping data F and af(-) as in the proof of Lemma 6.8. The 
fundamental inequality (85) is again used. Then, by the monotonicity principle (17), 
there holds for F € Ff, 


EB) =| D> BAGS: (Ho(F- 01), AB, 8) 
Q€O:nF-Q=F 
x : A 
Sa(F) > Po(F-76,5).5) (Tgp a BOO 
SES :n-S=F JEQ) ICS 
* w\2 = M2 
SoM Do Plo —7a,9.9" Yo (omix Yo aU? 
SES: n¢S=F JE Qs: ITS JEQ, ICS 


SnaFf Yo ox Ye aay] 


SCS: 17-S=F JEQ7 ICS 


sna Mory'[ > avy)!” 


JEQ, 11 -I=F 


After the monotonicity principle (17), we have used Cauchy-Schwarz, and the 
definition of 7. The quasi-orthogonality argument (27) then completes the analysis 
of this term, see (92). oO 


The last Lemma that we need is elementary, and is contained in the methods 
of [38]. 


Lemma 6.10 Let u > r+ 1 be an integer, and Q be an admissible collection of 
pairs such that |Q,| = 2"|Q>| for all Q € Q. There holds 


IBo(f.g)| S size(Q)|lfllollgllw- 


Proof Recall the form of the stopping form in (53). Observe, from inspection of the 
definition of the Haar function (10), that 


f(D) 
o(1;)'/2" 


IE, Arfl s 
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Then, an elementary application of the monotonicity principle gives us 


x 


Bof.al< FO! > oly PPO F—1).Di 7 


TEQ) J: UJ)EQ 


I 2491/2 
< flo s ss sap Pal — WAH), eI | 


1EQ, + J: (LNEQ 


hy) 1@J)| 


S size(Q)|lfllo Il lhw 


This follows immediately from Cauchy-Schwarz, and the fact that for each J € Qo, 
there is a unique J € Q, such that the pair (/, J) contribute to the sum above. Oo 


Context and Discussion 


The proof herein succeeds because the notion of size approximates the operator 
norm of the stopping form. Moreover, the ‘large’ portions of the stopping form, 
there is a decoupling that takes place. 

It is very interesting that one can prove unconditional results about the two 
weight Hilbert transform, following the techniques in [23], without solving the local 
problem. 


Elementary Estimates 


This section is devoted to the proof of Lemma 4.1. The estimates fall into many 
subcases, and are of a more classical nature, albeit the Az assumption is critical. (In 
fact, all the estimates in this section depend only on the half-Poisson Az hypothesis, 
but this is not systematically tracked in the notation.) In addition, all estimates 
should be interpreted as uniform over all smooth truncations. Some of these are off- 
diagonal estimates, for which the smooth truncations are important. The uniformity 
over truncations is however suppressed in notation. 

First some basic estimates are collected. This is property of good intervals, which 
can be effectively used in non-critical situations. 


Lemma 7.1 For three intervals J,I,I' € DwithJ CIC I, |J| = 2-*|I|, with 
s > rand J good, then 


P(o- (I! —D,J) < 2-9 -’P(o - 1’, I). (95) 
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Proof Note that for x € I’ — I we have 
dist(x, J) > ||! @|7/* = 20-97]. 


Using this in the definition of the Poisson integral, we get 


/ lJ] 
P(o-( —I),J) <2 ——. o(d. 
@-U-D)s I ane 
|J| [Z| 
d. 
STH Jes WF diste ye 
1| 
< as | | dx) = 27 s(1—2e) p T’ _] JI ; 
‘ yi (dist, ye as 
| 
Proposition 7.2 Suppose that two intervals I,J € D satisfy |I| => |J|, and 310 J = 
@, then 
sup (H(o1).F)6| So yw (96) 
0<a<B (| - dist(I, Jy? 


Proof Since h’; has w-integral zero, estimate as below, where x, is the center of J. 
HL Hil = | / / Ka ply x) “3 (a) w(dsdo(ay)| 
TJJ 
=| ff aso) Kool an}R5 6) wld otay) 


“lo 
S) cae (|J| + dist(/, J))? |Ay; (x)| w(dx)o (dy). 


The Lemma follows by inspection. Oo 


Proposition 7.3. Suppose that two intervals I,J € D satisfy 2°|J| = |I|, where 
5 > 1, the interval J is good, and J C 31 \ I, then 


sup |(H(a1), hy) ww] S 2-9 oD) Jw! (97) 


0<a<B 


Proof Under the assumption of the Lemma, the proof of Proposition 7.2 holds, 
supplying the estimate estimate of that Lemma. But, the extra assumption that J is 
good implies that dist(J, 7) > 2°'~®)|J|, and then the estimate follows by inspection. 

Oo 
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The Weak Boundedness Inequality 


The following inequality is a weak-boundedness inequality, a consequence of the 
Az inequality. Here, we look at the Hilbert transform inequality on two disjoint 
intervals. 


Proposition 7.4 There holds for all disjoint intervals I, J with no point masses at 
their endpoints, 


sup |(H(of -D.g-J)wl SAY Ulfllollgllw- (98) 


0<a<B 


The constant on the right can in fact be taken as follows. For a point a that separates 
the interiors of I and J, with I to the left of a, 


(a—r,a) 


MOET + POA eo) (a.a +1) 22". (99) 


Ww 
sup P(O1(~c0,a); (a, a+ r)) 
r>0 


In particular, for arbitrary intervals I and J with no point masses at the endpoints, 
2 
(Hol. J)vl S Ay "to Dw]? (100) 


It is useful to note that the global integrability of indicators is then a consequence 
of the A, and interval testing conditions. 

Since the intervals are disjoint, there is no possibility of cancellation in the 
estimate, and it therefore is closely relate to the Hardy inequality. In the two weight 
setting, this has been characterized by Muckenhoupt [32]. 


Theorem F For weights W and o supported on R. 


| [fom], < 30. (101) 

(0.x) ” 

where B* ~ sup / w(dx) x / o(dy). (102) 
0<r<oo J (r,co) (0,r) 


For the sake of completeness, we recall Muckenhoupt’s proof of this result. This 
preparation is proved by integration by parts. 


Proposition 7.7 Let ¢ be an increasing function on (0, 00), with @(0) = 0 and ¢ 
strictly positive on (0,00). Then, 


i b()"2dp(s) < 24a)", (103) 
(0,x] 


with equality if @ is continuous. 
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Proof of Theorem F We are free to assume that the function ¢(x) = o((0,x)) is 
strictly positive on (0,00). Then, multiply and divide by ¢(x)!/*, and use Cauchy— 
Schwarz to see that 


| - / FO)6(0)"? o(dy): / $60)-"? o(dy) W(dx) 
(0,x) v (0,00) J (0,x) (0.x) 


<2 / FO) 60)! o(dy) -6()"? Wax) 
(0,00) 4 (0.x) 
= 1/2 1/2 
2f) soon"? | geo!” wd ota) 


Above, we have used (103), and then Fubini. Concentrate on the inner integral. Our 
definition of B and Proposition 7.7 gives us 


1/2 
I. i wa] w(dx) < 2B he wa] 


And, now we can estimate 


1/2 


1/2 
| Le a(d)|, = 48 [oy ovo0| [. wt o(dy) < 48°F. 


The proof is complete. oO 


Proof of Proposition 7.4 Interval testing and (98) prove the estimate (100), so we 
turn to the proof of (98). 

After a translation, we can assume that 0 separates the interiors of J and J. Let us 
assume that J is to the left of zero. We change the problem. Set 0 (dx) = o(—dx) for 
x > 0, and for f € L?(I,o), set (x) = f(—x). Then, 


(Hof. 8)w =[ LOB) 5 (ayia) (104) 
00,0) (0,00) Y-* 
--| i; POE) 5 ayyw(dr). (105) 
(0,00) (0,00) *+Y 
The double integral is split into dual terms, one of which is 
/ PO)EO) 5 (ay) (ds). (106) 
(0,00) J(0,x) XY 


We analyze this bilinear form. 
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Note that x + y ~ x in (106). Thus, it suffices to estimate 


[.. | oe #0) aay) w(dx) = tes | i o) aa) mu < Bil. 


Xx xX 


where 8 is as in (102), and w(dx) = pe ando =o 
It remains to estimate the constant B, For any 0 < r < oo, 


/ &(dy) a” “ w(dx) S Ar. 
(0,r) 


(r,00) r (r,oo) X 


The more precise conclusion (99) can be read off from this inequality. Recall 
that (105) is split into two bilinear forms, and we have only considered one of them. 
This explains the symmetric form of (99). oO 


The Different Subcases of Lemma 4.1 


Lemma 4.1 follows from appropriate bounds on these bilinear forms, and their duals. 


Brean g) = > Ho Arf, AV) wl (107) 
102277" <|J|<|7 
3INI#G 
BY f.g):= SY (Ho ATF, AY) wl, (108) 
IJ :3103J=0 
Be (f.g):= SY) (HAP, Ald) wl. (109) 
J: 2"|J|<|1 
IC3I\I 
Reet g) = ~ | ar —1, Art (Ho I —I;), A’) wl. (110) 
IJ: JEly 


Lemma 7.8 For « € {nearby, far, close, adjacent}, there holds 


7 1/2 
B*(f,2) S Ay Ufllollgllw- 
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The Nearby Term 


One can check directly that for each interval J, with child J’, there holds |E’,A?| < 
o(’)~'/?. It then follows from (98) that |(Hjh?, h””),.| < H. And then, 


Rene. g) < KH > FDaV)| Ss Hf lle lle llw- 


1J:277 "I <|J|<I7 
3INJ#D 


The last line follows from the fact that for each J, there are only a bounded number 
of J occurring in the sum. . 
Here, and below, we will be using the notation f(J) = (f, h7)«. 


The Far Term 


We consider the case of |/| < |/|, and 31.0 3J # @. It follows that J C 3°T!7 \ 3°7 
for some integer s > 1. For an interval K, integer s > r and t > 0, consider the two 
projections 


Mf= d Aff 
Ti1C3F2K\3 AIK 
[1|=IK| 


w —_ w 
Nk 548 = y Ay8: 
J: JIC3'K 
2°|J|=|K| 


These projections satisfy, for fixed s, t, 


DEAS < WIZ. (111) 
K 
with a similar bound for Tk , ,g. Also, we need to bound 


Yo | Ho ME sf TK 5:8): (112) 


s=>r t>0 


But, using the fact that A?’ has mean zero, and the distance between the support 
of If, f and My , ,g is approximately 3’|K|, we have 


2” |K| 


| (Ho Wah s TT 528) w| = 31K 2 We sf Ii) [ks 18llz4ov) 
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z Va (3't+2K)w(3'K) 
~ 2°32(K| 

—sq-t gq 1/2 o w 
S23 ALPINE lle MITE Bll: 


Ti sf lho tT 5.18 lhe 


Since we have gained geometric decay in s, and t, and we have the inequality (111), 
we can easily complete the proof of (112). 


The Close Term 


For integers s > r, the sum below a relative length of J with respect to J. 
Applying (97), 


PD (Ho Att, AT) wl Ss gil—2e)s > FDeV)| vow) 


LJ: 25|J|={I| I,J: 25|J|=|1| I 
JO3I\I JC3I\I 
26s COd;n, VOW P 
S20 DOIG BO Vw) 
I J 2|J=|I| 
JO3I\I 


We have the geometric decay in s. Apply Cauchy—Schwarz, one term is ||f||>. The 
other term, squared, is 


o(1) : : 
ae Yo aux Yo wHSAY YO BUY < Adlielli. 
I J224|J|=|0) F223 |F| =|] IF: 2\\=|1| 

JC3I\I JC3I\I JO3I\I 


This completes the estimate. 


The Adjacent Term 


We argue as in the previous case. It is easy to see that |E7_, A7f| S FDlo(I - 
I)7/2, 

For 6 4 6’ &€ {+}, and consider the sum below, where s plays the same role as 
before. 


BRAS ole A¥e)y| S2rO Pr FEMI 
LJ: 25|J|=|I| LJ: 25|J|=|I| 
JCI+(0’ |Z) JCI+(6'|I|) 


=f1— 1/2 
2a lelele 


Vo(la)wi) 


[Z| 


The details are suppressed. 
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Context and Discussion 


The techniques of this section are all drawn from the work of Nazarov-Treil-Volberg 
[38, 56], aside from the use of the two weight Hardy inequality, which is drawn 
from [20]. 


Proof Under the Pivotal Assumption 


We prove an upper bound for a two weight inequality assuming a pivotal condition 
on a pair of weights. The setup is as follows. Let K(y) satisfy the size and gradient 
condition 


|x — y|-|VK(x, y)| + |K@x, y)| < |x’. 


We will consider the operator Tf given formally by p.v. { K(x, y)f(y) dy. In the two 
weight setting, no principal value need exist, so given two weights o, w, we consider 
the constant N’;, which is be the best constant in the inequality 


| [Kenroy o(dy)| < Nrllfle- 


Let P be the best constant in the pivotal inequality, defined as follows. For any 
interval Jj and any partition P of Jp into intervals such that neither o nor w have 
point masses at the endpoints, there holds 


D5 P(o(lo\ DD? w(D) < Po (Io). (113) 


IEP 


We also require that the dual inequality, with the roles of w and o reversed, holds. 
One can note that this inequality will hold if the maximal function satisfies the two 
weight inequality ||Mof ly S |[fll-, and its dual. 


Theorem 8.1 (Nazarov-Treil-Volberg [56]) Assume that the pair of weights w,o 
satisfy the Ay condition (3), and the pivotal conditions hold, namely P < co. Then, 


there holds Nr S Tp + Ay! ae P, where J is the best constant in the inequalities 


i \To1|? w(dx) < To), ij \T,1|? o(dx) < Tew). 


We give the proof, with the goal of highlighting some of the difficulties that one 
must face in the general case. In addition, a quantitative higher dimensional version 
of this Theorem was key to [45]. We will use Calder6n-Zygmund stopping data, to 
facilitate comparisons to the general case. This will also give an easier proof than is 
in [45, 56]. 
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Off-Diagonal Estimates 


We need a typical off-diagonal estimate, one that is far less refined than the 
monotonicity principle. 


Lemma 8.2 For all 0 < a < B, good intervals J © I, and function f is supported 
off of I, there holds 


\(Tof,g)| S Polf| 1°, DwV)' |lellw- (114) 


for any function g € L*(w), supported on J and with integral zero. 


Proof Use the standard subtraction argument to see that 


Mor. cn =| ff i G.y) ~ Kes. 9FO)RC) (a) wd) 


|x — x, 
S : i Of) g(x)| o (dy) w(de). 
sg JR\r Oy — y) 
The bound follows by Cauchy—Schwarz and inspection. q 


The Global to Local Reduction 


One need only prove that 
(To Pool + Peood8)wl S Tif lle llgllw. 


where J := Up + Ay > 4D. The set up is much like section “Global to Local 
Reduction”. We will understand that the functions f and g can be assumed to be 
good functions. In fact, f has the ‘thin’ Haar expansion in (21), and similarly for g, 
in order to reduce some case analysis below. 

In analogy to (23), define 


BPE = >) >) ELAS: (Tol, Ave)w, (115) 


I:ICIo J: JEl 


and define B>*W(f, g) similarly. Since Lemma 4.1 depends only on the Ay assump- 
tion, we have 
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Lemma 8.3 There holds 


j 7 1/2 
| (Taf. 8)w — BO” (f, 2) — BP" (f, 2)| S Ay lIfllollgllw- 


Thus, the main technical result is 


Lemma 8.4 There holds 


Brr"(F, a) S Tif llellallw- (116) 


The same inequality holds for B’*" (f, g). 
The stopping intervals are defined similarly. 


Definition 8.5 Define F, the stopping intervals, recursively by initializing ° € F, 
and in the recursive step, if F € F is minimal, add to F the maximal subintervals 
F' CF, with F’ € Dy, so that meet either of these conditions: 


f stopping EZ,|f| > Ca,y(F) := EF|fl. 
Pivotal Stopping P(o- Ip, 1)2w(1) > 10P?o (J). 


That is, we stop if either the average of f becomes too large, or, essentially, the 
pivotal quantity becomes too large. 


We use the same notation as in section “Global to Local Reduction’, and in 
analogy to Corollary 4.4, there holds 


Lemma 8.6 (The Global to Local Reduction) There holds 


Bere 8) S Tlfllellgllw: (117) 

where BE (f.g):= >. ELAS: (Tols, A¥8)w. (118) 
LJ: iz¢IGI 
Jel 


Proof This variant of the ‘Hilbert-Poisson exchange’ argument is needed. Holding 
F é€ F fixed, we sum over J with z¢J = F and / with F C J. Then, the argument 
of T, is Ir which is written as Ip = F + (/p \ F). Defining er by 


)> EZ AUS := era,(F), 


1:1DF 


these constants are bounded by a constant: |e-| < 1. Then, 


OF) =| D> DD ELAM (ToF, Ava)w 


TiIQF J: a¢J=F 


=|(rF, 32 eAzs) 


Ji tzJ=F 


< Toy(F)o(F)"?||OFe lw 


Ww | 
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This depends upon the testing assumption on 7, applied to intervals. The operator 
Ovg is the Haar projection defined at (26). Quasi-orthogonality as in (27) finishes 
the sum over F € F. 

The complementary case is that of the global-to-local reduction. But, under the 
pivotal condition there is a geometric decay along the stopping tree. For F € F, 
and integer j, let chj(F) be the j-fold descendants of F in the collection ¥. That 
is, cho(F) = {F}, and F” € F” € chj4\(F) iff F’ is the child of some interval 
Fle ch(F ). 

We will index by F € F, F’ © chi(F), and F” € ch,(F’), where j > 0. 
Using (114) and critically, Lemma 7.1, we have 


| SO BRS To lr \ F’), Ofra)w| S (PPO + (F\ F'),Fw(F" I Of sll 


1: 1FI=F 


S of(F)2°- YP (F\ F’), Fw”)? OF gllw- 


We have geometric decay in j above. Moreover, summing over F’ and F”’, we can 
appeal to the pivotal condition (20) to see that 


Yo PO: (F\ F’),F’w(F’) || Orr glhw 


F’echj+1 (F) 


1/2 
<[ Por FFE x Spal 


Fl echj41(F) Pech; 44 (F) 


1/2 
soon) » 17a 


F’ €chj+1 (F) 


Then, quasi-orthogonality is used to estimate 


1/2 
Yayo 3 10% S Iiflollellw- 


FEF PF’ ech;+1(F) 
This completes the global to local reduction. oO 
The Local Estimate 


It remains to prove the following local estimate: 


[BP (Pef. 8) S Tla(F)o(F)'”? + ||Pefllo}lgly, Org = 8, 


for then quasi-orthogonality will complete the bound on Beye (f, g). 
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In the bilinear form above, the argument of 7, is, for a pair of intervals J € J, 
I; = (F —1;) + F. Using linearity, and focusing on the argument of T, being F, we 
can repeat the argument of (34), which depends upon the fact that the averages of f 
are controlled. Below, there is an requirement that J; has F-parent F’, which we are 
free to add since Of'g = g. 


| De EL ATS «(Tok AY8)w| S Tay (F)o(F)'/? |Ig hw. 


Iin¢l=F J: n¢lj=F 


This bound follows the argument of (34), and we suppress the details. 
It therefore remains to consider the stopping form 


Br’¢f.g)= DY >) ELAS: (Too — hi), A¥a)w- 


Ii nz¢l=F J: a¢lj=F 


Lemma 8.7 For all F € f, there holds 


Bef. 8)1 S Plfllollgls. 


Proof This depends very much on the selection of stopping intervals. In fact there is 
geometric decay, holding the relative lengths of J and J fixed. Estimate for integers 
S2T, 


oT Arf . (Ts (Io = Ij), AY 8) wl 


Iinpfl=F J: IG .aFl=F 


(7|=2° [J] 
tke) er 
< PY ae POF = 10). DAY) wlBO| 
1: FI=F ge{+} o(Ie) J: J€lg.n¢l=F 
\|=2° [J] 
A 1/2 7 1/2 
<m| y> far] x| + aol 
1: 1FI=F J ICF 
1 
where Mm ‘= max sup > P(o(F — Ig), J)°w(J). 


6eft} 1:27 =F O(I9) JiJ€laelyy=F 


W=2°\J 


Here, we have used (a) used the bound |E7 A7f| < a tee (b) appealed to (114); 
(c) used Cauchy—Schwarz, together with the fact that for J € F, there is a unique J 
containing it, with length 2°|J/]. 

It remains to bound M,, gaining a geometric decay in s, and appealing to the 
pivotal condition. Return to the inequality (95), to gain the geometric decay, 
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Yi P@(F = 19), JPW) S 2-9" PG - F, Io)’ w(Io) S 2° Po (Io), 


J: J€lg, nFly=F 
l=2°|J| 


where the decisive point is that Jg has F-parent F, hence it must fail the pivotal 
stopping condition. Oo 


Example Weights 


The sharpness of the different conditions in the main theorem is the subject of the 
this section. 


Theorem 9.1 There are pairs of weights 0, w, with no common point masses, that 
satisfy any one of these conditions. 


(1) The pair of weights satisfies the full Poisson A condition, but the norm 
inequality for the Hilbert transform (1) does not hold. 

(2) The pair of weights satisfies the full Poisson Az condition, and the testing 
inequality (4), but the norm inequality for the Hilbert transform (1) does not 
hold. 

(3) The pair of weights satisfy the two weight norm inequality (1), but not the 
pivotal condition (20). 


Point (1) is a counterexample to Sarason’s Conjecture, first disproved by Nazarov 
[34]. In contrast to his argument, an explicit pair of weights are exhibited. 


The Initial Steps in the Main Construction 


Let C = —_ C,, be the standard middle third Cantor set in the unit interval. Thus, 
Co = [0, 1], Cy = [0, 4] U [§, 1], and more generally 


C= 'e {Ix,x+ 3] i: x= yes gj € {0, 2}}. 


j=l 


Let w be the standard uniform measure on C. Thus w(J) = 2~” on each component 
of C,, n € No. This is phrased slightly differently. Let K be the collection of 
components of all the sets C,. Then, for each K € K, there holds w(K) = |K| m3 

The weight o will be a sum of point masses selected from the intervals in G, 
taken to be the components of the open set [0, 1] — C. (G is for ‘gap’. See Fig. 8.) 
Consider the Hw restricted an interval G € G. This is a smooth, monotone function, 
hence it has a unique zero zg (Fig. 9). Then, the weight o is 
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HH HH HH HH ° ° ° e 


Fig. 8 The approximates to the Cantor set C on the /eft, and on the right, the gaps, namely the 
components of [0, 1] — C . The intervals on the /eft are in K, and those on the right are in G 


Fig. 9 The selection of the 
points zg and z, for a gap 
interval G. The function Hw, 


restricted to G is monotone Z 

: . G 
increasing, hence has a G 

unique zero, the point zg. The 

second point zG will be to the Zz t 


right, but a distance to the 
boundary of G that is at least 
c|G|, for absolute constant 


1 
O0<c<35 


o:= Y> sa 8:6: (119) 


GEG 


where sg > 0 will be chosen momentarily, consistent with the Az condition. A 
second measure is given by 0’ = )ogeg SG: by , where zi, is the unique point in G 
at which Hw(z,) = |G|-!+n3, See Fig. 9. 

The constants sg are be specified by the simple Az ratio 


w3G) o(G) ae 
—— - _— =2, thatis sg = 2|G|" ™. (120) 
IG| |G 


To see this, note that 
w(3G) = w(G— |G) + w(G + |G) = 2\G|*8 , 


since G + |G| are components of some C,,. With this definition, the basic facts about 
the w and o come from the geometry of the Cantor set and the relations below, 
wll) S [7] 3 , Tis triadic, 
(121) 


In2 
o(f) < [yom T is triadic, J not strictly contained in any G € G. 
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On the other hand, if J € G U K, the inequalities above can be reversed, namely 
wl) ~ |), o() ~ |, =e GUK. (122) 


The properties of these measures that we are establishing are as follows. 
Lemma 9.2 For the measures just defined, there holds 


(1) The Hilbert transform H, is bounded from L*(c) to L?(w). 
(2) The Hilbert transform H,’ is unbounded from L?(a') to L?(w), but the pair of 
weights satisfy the Az condition, and the testing conditions 


sup o/(1)"! [veoP dw < 00. 
I 


Tan interval 


Concerning point 2, the unboundedness of A, is direct from the construction 
of 0’. 


i (Hw)? do’ = )° Hw(z,)?o" (kzG}) (123) 
GEG 
= S-|GPom-20-h) = S*\G|th = 00. (124) 
GEG GEG 


There are exactly 2”~! elements of G of length 3~”, proving the sum is infinite. 


The Poisson Az Condition 


Lemma 9.3 For either weight ju € {0,0'}, the pair of weights w, ju satisfy the Az 
condition. 


Proof It suffices to check the Az condition on the triadic intervals in the unit interval. 
Let us begin by showing that for any triadic interval J e K UG, 


I 
Po,DS ae and P(w,DS 


w(3/) 
al (125) 
[7 

For then, the control of the simple A2 ratio will imply the control of the full A2 ratio. 
(For the inequality on w, the triple of the interval appears on the right, since w(/) 
can be zero if J € G.) Now, it will be clear that this argument is insensitive to the 
location of the points zg and 7%, so the same argument for o will work equally well 
foro’. 
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Let us consider o. Using (122), there holds 
o(!) yf I 
P(o,l) < —+ - o (dx 
( ) I 2 3kI\3k-1 ({I|? + dist(x, 1))? ( ) 
oo k 
eS o(/) y 0 (3°T) 
~ \T 1 3*|3*7| 
00). estates 2 OO ee 2 OO) 
s+ 3 *|3°7 | m3 SS 3°73 
I 2 IZ| 2 IZ| 


Turning to the weight w, one has 


w(31) > | [Z| 
P(w,l) < —— 
aca +2 anus UE dite ye MO 


ax 


w(3l) ‘> w(3*1) 


I 3/37] 


k=2 


31 = : 31) = i 31 
< “ep 4 S23 ti < _ ) 3k < _ a 
k=2 k=1 


The A, product P(o, /) - P(w, J) has been bounded for J € K U G. Suppose that 
T is a triadic interval that is not in these two collections. Then, J must be strictly 
contained in some gap G € G. Writing = G, where, J denotes the k-fold 
parent of / in the triadic grid, we have w(G) = 0. Hence, 


_ i east 
Pew. = fa Tardis Tye MO) = TPO.) 


First, consider o restricted to the gap G: 


P(w, I)P(o -G,D) < 3-*P(w, aa ~ Pw, oe mele 


Now, we have to consider the Poisson average of o off of the gap G, in which case 
we have 


P(o - ((0, 1] \ G),D ~ 3-*P(a, G), 


and so the estimate follows. oO 
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The Testing Conditions 


We turn to the testing conditions, using in an essential way the precise definition of 
the weight o: it gives a huge cancellation, which simplifies things considerably. 


Lemma 9.4 For any interval I, there holds 
/ |Hy1|? do < wi). (126) 
I 


Proof By construction of o, there are two reductions. The first is simple, namely 
that the two endpoints of the interval J can be taken to be an endpoint of an interval 
in G. The second comes from the construction of 0: Hw = 0, relative to do measure. 
Hence, 


[ver do = [00 1] —1)° do, 
I I 


namely the complement of I is the argument of the Hilbert transform on the right. 
Then, one abandons all further cancellations. Let us show that for all intervals 
K € K (the components of the sets C,, which generate the Cantor set), 


/ |H,,Kn|’ da < w(K), (127) 
K 


where K;,; is the right component of [0, 1] \ K. The same estimate holds for the left 
component, and this completes the proof. For, if we set J, to be the right component 
of [0, 1] \ J, and take K', K*,..., to be the maximal intervals in K contained in J, 
there holds 


Co 
/ (Hylan)? do < S> / (Hy Kn)? do 
IT n=l kn 


Sw") S wD. 
n=1 


Now, for K € K, let K,, K2,..., be the maximal intervals in K that lie to the right 
of K. Arranging them in increasing length, note that the length of K, is either |K| or 
3|K|. For n > 2, the length of K,, increases by a factor of 3, and dist(K, Kn) = |Knl, 
and hence there are at most 1 — log;|K| such intervals in K. Here is an illustration: 


K kK K) K 
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Then, one has the estimate below, where the sum is of a decreasing geometric 
series, estimated by its first term. 


oe) 


|HKul S >> 


n=1 


w(Kn) w(K) 
[Kl IKI 


Hence, (127) follows from the control of the A> ratio. oO 


An important part of the remaining arguments is that points zg, and 7 cannot 
cluster close to the boundary of G. 


Lemma 9.5 There is a constant 0 <c < 5 such that 
Ize — Zl = elGl. 
Proof Estimate Hw at the midpoint z{ of a component G. By symmetry of the 


Hilbert transform, and the Cantor set, it always holds that H(w13g)(z) = 0, so 
that appealing to (121), 


IHw(z6)| = IA (wl ey) (ZG) | 


n 
S Vista 't8 s jap tt es 
k=2 


Next, we turn to a derivative calculation. The function Hw, restricted to G is a 
smooth function, one that diverges at the end points of G at a rate that reflect the 
fractal dimension of G. For any x € G note that 


d w(dy) 

—_H = 

a | Gam? 
> WOO) gtk, 
me GIP 


This is a uniform lower bound, and in fact the lower bound is very poor at the 
boundaries of G. Indeed, 


ad In 2 
Hw(x) > dist(x, 9G)~2+ 3, 
dx 


It follows that we have to have |zg — z,| < c|G|, for some 0 < c < 1. That is, one 


need only move at fixed small multiple of |G|, passing from the location of the zero 
zg to the point z. Oo 
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The second half of the testing intervals inequalities is as follows. 


Lemma 9.6 For wu € {0,0’}, and any interval I, 


[rsar dw < w()). (128) 
I 


Proof For the sake of specificity, let u = o. Indeed, by Lemma 9.5, the same 
argument will work for 0’. To fix ideas, let us assume that J € K. Write the left, 
middle and right thirds of J as J_, Ip, 1, respectively. Then, note that 


; H, (1)? dw = / H, (1)? dw (129) 
T I_-,Uh 
< / Ho (lo)? dw + / Ho (Ip + 11)? dw + / Ho (1-1 + In)? dw 
I-\Uh IL-4 q 
(130) 
+ H,(I-1)* dw + i H,()* dw. (131) 
I-\ q 


The first term on the right is simple. On the interval Jp, o is a point mass, at a point 
that is at distance > c|/| from /4,. Thus, by (122), 


|e} 


[Z| 


In2 


[Z|m3 ~ o(Z). 


i Hy (Io)? dw < 
I-,Uh 


That completes the first integral. The remaining two integrals in (130) are handled 
by a similar argument. 

Concerning the two integrals in (131), one should note that J; € K and that 
o (41) < 372+?in3 o(1). This geometric factor is smaller than 5, therefore one can 
recurse on (130) and (131) to see that 


/ H,(K)* dw < a(K), KeK. (132) 
K 


For a general interval J, since o is a sum of Dirac masses, we can assume that the 
interval J is in a canonical form. Namely, each endpoint of J can be assumed to be 
an endpoint of an interval in G. The basic inequality is 


> [ita-~P dw So()), (133) 
K 


KEK, 


where K; is the maximal elements of K contained in J. The integration is over K, 
and the argument of the Hilbert transform is J — K. 
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To see that (133) implies the Lemma, note that by (132), 


H, (1)? dw = He (1)? dw 
T 


x 
y fe (I— K) dw+ > fw? dw 
eG KEK, 
So) + Yo o(K) So. 


KEK, 


In fact, (133) follows from 


o(I)” 
[2/7 


[ite (I—K)|? dw< w(K), Keéeky. (134) 


For this is summed over K € K;, and then one uses the A» property. 

To prove (134), all hope of cancellation is abandoned. For an interval K € K7, let 
us consider component J, of J — K which lies to the right of K. It has a Whitney like 
decomposition into a finite sequence of intervals J,,...,J, that we construct now. 
These intervals will have the property that they are (a) pairwise disjoint, (b) their 
union is J, (c) and dist(K, supp(oJ;)) = |Js| > 32|K|, for all 1 < s < t. 

Now, J; = K + |K| € G. If this interval is not contained in /, it follows that K 
contains the right hand endpoint of J, and there is nothing to prove. Assuming that 
J, C TI, the inductive step is this. Given J,..., Js, as above, whose union is not [it 


(1) If J; € G, then J, + |J,| € K. If this interval is contained in J, then we take 
J = Js + |J;| € K, and repeat the recursion. Otherwise, we update J, := 
In — (ee , J;, and the recursion stops. 

(2) If J; € K, then it follows that J;,_; € G, and the element of G immediately 
to the right of J; is 3(/; + 6|J;|). If this interval is contained in J, then we 
take Jsg.1 = 3s + 6|J;|) € G, and repeat the recursion. Otherwise, we update 
Js = In — Biase J,, and the recursion stops. 


With this construction, it follows that 


o(Js) _m2 ey) 

|Ho (In) * K| oy Ti suit Sy 

This proves the ‘right half’ of (134), that is, when the argument of the Hilbert 
transform is J. The ‘left half’ is the same, so the proof is complete. oO 


At this point, we have proven that the pair of weights (w,o’) satisfy the full 
Poisson Az condition, and the testing condition (128). But, ||Hw]|;2(,”) is infinite, 
by (124). Hence, points (1) and (2) of Theorem 9.1 are shown. 
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We have also shown that the pair of weights (w,o) satisfy the full Poisson 
Az condition, and both sets of testing conditions. Hence, by our main theorem, 
H,, is bounded from L?(w) to L?(c). This pair of weights also fail the pivotal 
condition (20) of Nazarov-Treil-Volberg [38]. This is verified by observing that the 
collection G of gaps is a partition of [0, 1], and 


y- Pw, GWG) = >> HO” Ai) 


2, 
GEG GEG IG| 
~ 5° w(3G) ~ SIG" = 00 
GEG GEG 


since G contains 2” intervals of length 3~”, for all integers n. Here, we have 
used (125), followed by (121). Since inf,eg Mw(x) 2 P(w, G), this also shows that 
the maximal function M is not bounded from L?(w) to L?(o). 

Notice in contrast that the energy inequality (19) for the partition G is trivial, 
since o restricted to any interval G is a point mass, hence E(o,G) = 0, for all 
GeG. 


Context and Discussion 


Counterexamples were an important source of inspiration on these questions. The 
early paper of Muckenhoupt and Wheeden [33] includes an example of the fact that 
the simple A» condition is not sufficient for the two weight inequality. For instance, 
the boundedness of the simple A2 ratio is simple to check for the pair w = do, and 
0 (dx) = x1 0,00)dx. Then, one sees that for f = iy, 


Vlogl x |Ifll- « logL = ||Hofllw, L>1. 


Thus, the Hilbert transform is unbounded. And, one can directly see that the half- 
Poisson A» condition fails. 

Much harder, is the fact that the Poisson Az condition is not sufficient. This was 
the contribution of Nazarov [34]. This example lead to the conjecture of Nazarov- 
Treil-Volberg [56] proved herein. A more delicate example, of a pair of weights 
which satisfied the Poisson Az condition, and one set of testing conditions, say (4), 
but not the norm inequality was that of Nazarov-Volberg [40]. Also see Nikol’skii- 
Treil [42], for a related example to disprove a conjecture about similarity to a normal 
operator. Both of these latter examples were based upon Nazarov’s indirect example. 

The example given here is directly inspired by a Cantor set type example in 
Sawyer’s two weight maximal function paper [52]. It is drawn from [20], with the 
purpose to show that the pivotal condition of Nazarov-Treil-Volberg [38, 56] was 
not necessary for the two weight inequality to hold. This was an explicit example, 
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and also pointed to the primary role of the notion of energy. It is very interesting 
and delicate, in that the point masses have to be placed on the zeros of the Hilbert 
transform, in order to obtain the boundedness of the transform. It is also humbling 
in that it still does not reveal how delicate the proof of the sufficiency in the main 
theorem needs to be. 

It is subtle example of Maria Carmen Reguera [47] and Reguera-Thiele [49] that 
proves this, as is pointed out by Reguera-Scurry [48]. 


Theorem G There is a pair of weights for which the maximal function M, is 
bounded from L?(a) — L?(w) and M,, is bounded from L?(w) > L?(c), but norm 
inequality for the Hilbert transform (1) does not hold. 


This is quite a bit more intricate than the examples we have presented. It had 
been suggested, in the early days of the weighted theory, that the boundedness 
of the maximal functions would be sufficient for the norm boundedness of the 
Hilbert transform. On the other hand, if one considers ‘off-diagonal’ estimates, then 
boundedness of the maximal function is sufficient for norm inequalities for singular 
integrals [8]. 


Applications of the Main Inequality 


The interest in the two weight problem stems from a range of potential applications 
arising in sophisticated arenas of complex function and spectral theory. The 
motivations for these questions are complicated, and based upon subtle theories. The 
connections to the two weight Hilbert transform are not always immediate, and the 
properties of interest are frequently more intricate than those of mere boundedness 
of a transform. Nevertheless, the acknowledged experts Belov-Mengestie-Seip in 
[4] write “...we have found it both useful and conceptually appealing to transform 
the subject into a study of the mapping properties of discrete Hilbert transforms. 
We have learned to appreciate that the essential difficulties thus seem to appear in 
a more succinct form.” A brief guide to the subjects, and some of the ‘essential 
difficulties’ follow. 


Sarason’s Question on Toeplitz Operators 


This question arose from Sarason’s work on exposed points of H! [50]. Indeed, 
this was part of an influential body of work that pointed to the distinguished role 
of de Branges spaces in the subject. This paper contains examples of pairs of 
functions f, g, for which the individual Toeplitz operators where unbounded, but 
the composition bounded. 
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Question 1 (Sarason [51]) Characterize those pairs of outer functions g,h € H 
for which the composition of Toeplitz operators T,T;, is bounded on H?. 


Following [51], for a function h € L?(T), the Toeplitz operator 7), can be thought 
of as taking f € H? to the space of analytic functions by the definition 


Tift) = 5 | fle yh(e" VEAP a, 


where k,,(z) = ee nae is the reproducing kernel. 
Also in [51] is an, argument of S. Treil that a Poisson Az condition is necessary 
condition for the boundedness of the composition: 


sup P\f|?(z)Plg|?(z) < 09, (135) 


zeD 


where P denotes the Poisson extension to the unit disk. Sarason wrote that ‘/t is 
tempting to conjecture that the last condition is also sufficient for the boundedness 
of T,T;,. This statement, widely referred to as the Sarason Conjecture, is of interest 
in both the Hardy and Bergman space settings.(Aleman-Pott-Reguera [2] have 
resolved the conjecture in the negative in a Bergman space setting. A striking 
argument in which they prove the boundedness of the Bergman projection is 
equivalent to the boundedness of the positive part of the Bergman projection. This 
allows a much simpler counterexample to be identified.) 

The connection with the two weight problem for the Hilbert transform is indi- 
cated by the diagram from [7] §5, see Fig. 10. In the diagram, M; is multiplication 
by / and P is the Riesz projection from L? to H*. The boundedness is equivalent 
to 


M,P+M; : H? + BH’. 
The structure of outer functions leads to these simplifications. Since the product 


of analytic is analytic, the second H* above can be replaced by L”, and then, the 
outside multiplication M, can then be replaced by M),|. Thus, we are considering 


Fig. 10 Sarason’s Question Tg T; 


concerns the top line of the H2 he H2 
diagram, which is equivalent 

to the lower part of the 

diagram. The operator M;, on 

the /eft is an isometry onto its M- Mg 


range, while M,, the operator h 
on the right is an isometry 
between the two spaces 
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M\_\P+M; : H? +> L’. Now, f is anti-analytic, so we can replace H? above by L?. 
Moreover, the multiplication operator My,jf| is unitary, since an outer function can 
be equal to zero on T only on a set of measure zero. Thus, it is equivalent to consider 


Mg\P4+My, 2 L?  L?. 


This is a two weight inequality for P+. (Sergei Treil helped us with the history of 
this question.) 

The Riesz projection is a linear combination of the identity and the Hilbert 
transform, and our main theorem will apply to it. Note that the inequality 


IP+ (AP) Mlr2dgeany S We llz2ca 


is equivalent to 


IP+UFPW)llzdgeay S Iv llezwra- 


Recall that P+ = J— poze , according to how we defined the Hilbert transform, where 
I represents the identity operator. In the two weight setting, we interpret the norm 
inequality ||P+(of)|lw S |lflle, as uniform over all truncations 0 < t < 1 defined by 


Pity Sore ; £Q) 2 
vs 


t<|x—y|<r—! yx 


Theorem 10.1 For pairs of weights w, o that absolutely continuous with respect to 
Lebesgue measure, the norm inequality ||P+(of)|lw < |[flle holds if and only if the 
pair of weights satisfy the Poisson A, condition (3), and these testing inequalities 
hold, uniformly over all intervals I, for a finite positive constant P, 


i IP.(o1))[? w(dx) < Po(0), / Pw)? o (dx) < Pw(0). 
va I 


One must be sure that the A» inequality is necessary from the norm inequality. As 
it suffices to test real-valued functions, the real-variable proof given here will suffice. 
This in particular shows that for the densities of the weights, o(x) - w(x) < Ab, for 
a.e.x. Thus, the identity part of the norm, and testing, inequalities are trivial. The 
remaining parts just concern the Hilbert transform, so one can use the main result. 

If one is interested in the Sarason question for functions f, g that are not outer, 
there is no simple reduction to the two weight inequality for the Hilbert transform, 
and the problem is quite subtle, as the role of the multiplier P+ M; is more involved 
than that of just a weight. 
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Model Spaces 


For a probability measure o on T, define a holomorphic function @ on D by the 
Poisson integral 


1 
1-62) =|, eae - 


This is an inner function: A holomorphic map of D to itself which is unimodular 
a.e.on T. Also, 6(0) = 1. (The measure o is a Clark measure for 6, frequently 
written as 01.) 

The shift operator Sf(z) = zf(z) on H? has invariant subspace 0H? = {Of : fe 
H?}, whence Kg := H? © 0H? is invariant for S*. Beurling’s theorem states that 
every invariant subspace for S* is of this form. The model operator is Sg := PoS, 
where Py is the orthogonal projection from H? onto Kg. Remarkably, subject to mild 
conditions, every contractive operator on a Hilbert space is unitarily equivalent to 
a properly chosen Sg. For this, and other reasons, properties of the Kg spaces have 
broad significance. 

The spaces Kg and L?(c) are unitarily equivalent, with the unitary map from 
f €LV’(c) to F € Ko given by 


F() = (1-82) i BO) 


One is interested in those measures jz on T for which the natural eae ag operator 
is bounded from Kg to L?(z), namely, is it the case that ||F'l|,, < ||Fllx,. We see that 
this bound is equivalent to 


ive es o(dt)| [1 — (2 uae) < If. 


That is, the question is equivalent to a two weight inequality for the Hilbert 
transform on T. 

From this perspective, one can lift counterexamples concerning the two weight 
Hilbert transform to those for embedding operators, which is the tactic of [40], 
from which we have taken this condensed presentation. A characterization of the 
embedding question can be read off from our main theorem. 

But note that Clark measure is on T, by definition, and the second measure 
4 is constrained to be supported on T, whereas the disk would be the natural 
assumption. In the case where jz is supported on the disk, and one seeks an isometric 
embedding, the question has a remarkable answer, found by Aleksandrov [1]. The 
general question is resolved in [24], which gives a characterization of a two weight 
inequality for the Cauchy transform, under these restrictions on the supports of the 
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weights. The method of the proof is similar to that of the Hilbert transform, with 
some additional complications. 

The model spaces are also important to spectral theory, and the subject of 
rank one perturbations of a unitary operator. In spectral theory, it is important to 
understand the structure of the unitary operator that sends the Hilbert space to 
into L? of the spectral measure. Weighted Hilbert transforms arise therein. See for 
instance [42], which uses the example of Nazarov showing that the A, condition is 
not sufficient for the boundedness of the Hilbert transform. Also see [30]. 

We point the interested readers to [41, 46], and the many citations therein for 
more information about these subjects. 


de Branges Spaces 


We recall the setting of [3, 4]. For a sequence of distinct points T = {y,} C C and 
a sequence of positive numbers v = {v,,} consider the Cauchy transform 


an Un 


Yn 


Airy) 1 a= {ays a 


niz#Yn 


This is well defined for a € io and z € , defined by 


Qefec: > Eon <h 


niz#Yn 


Call H(L, v) the space of functions analytic on Q given by the image of €? under 
Hr,v). For appropriate choices of (I, v), these Hilbert spaces have deep connections 
to analytic function spaces. For instance, the reproducing kernels of H(T, v) are 


Un 
k,(€) = —_——__——__, 22. 
O-- eposm = 


And, many natural questions, such as the structure of frames of reproducing kernels 
for H(T’, v), require knowledge about the two weight inequality for the Cauchy 
transform. For instance, the main real-variable result in [4] is a characterization of 
a two weight inequality, but under the requirement that both measures be a sum of 
point masses on sparse collections of points. This yields interesting results in the 
setting of de Branges spaces. 

The definition of H(I,v) provides just one possible representation of a de 
Branges space, a class of Hilbert spaces with remarkable properties. The standard 
reference for them is [9]. Beginning from the works of Sarason [50], they have 
become an essential part of subject of analytic function spaces. 
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Singular Integrals, Rank One Perturbations 
and Clark Model in General Situation 


Constanze Liaw and Sergei Treil 
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Abstract We start with considering rank one self-adjoint perturbations Ag = A + 
a(-,g~)g with cyclic vector gy € H ona separable Hilbert space 1. The spectral 
representation of the perturbed operator Ag is realized by a (unitary) operator of 
a special type: the Hilbert transform in the two-weight setting, the weights being 
spectral measures of the operators A and Aq. 

Similar results will be presented for unitary rank one perturbations of unitary 
operators, leading to singular integral operators on the circle. 

This motivates the study of abstract singular integral operators, in particular the 
regularization of such operator in very general settings. 

Further, starting with contractive rank one perturbations we present the Clark 
theory for arbitrary spectral measures (i.e. for arbitrary, possibly not inner character- 
istic functions). We present a description of the Clark operator and its adjoint in the 
general settings. Singular integral operators, in particular the so-called normalized 
Cauchy transform again plays a prominent role. 

Finally, we present a possible way to construct the Clark theory for dissipative 
rank one perturbations of self-adjoint operators. 
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Introduction 


Rank one perturbations play an important role in operator theory and mathematical 
physics. One of the principal attractions of rank one perturbations is that for such 
operators almost everything can be explicitly computed, and then the advanced 
technique of Harmonic Analysis, like the study of fine properties of Cauchy type 
integrals, or advanced theory of singular integral operators can be applied. 


Main Players 
Rank One Perturbations 


Self-adjoint rank one perturbations occurred naturally in mathematical physics 
[45]. For example, a change in the boundary condition of a limit-point half-line 
Schrédinger operator from Dirichlet to Neumann, or to mixed conditions, can be 
reformulated as adding a rank one perturbation (see for example [40]). 

The technique of rank one perturbations was used in some results on orthogonal 
polynomials and Jacobi matrices, and there are some interesting applications to free 
probability (see e.g. [9, 10]). They also turned out to be useful in the investigation of 
certain random Hamiltonian systems called Anderson models and the longstanding 
Anderson localization conjecture [8]. Many specializations of this conjecture were 
studied in literature and the field is still very active (see e.g. [2, 17, 19-21], also see 
[28, 46] for a recent account of parts of the field). Rank one perturbations play a role 
in [1, 27, 30, 41]. Recent studies of closely related unitary Anderson models as well 
as accessible explanations of the physical relevance of these models can be found, 
e.g. in [23—25, 42]. The additive perturbation is replaced by a multiplicative one and 
the dynamical localization behavior is known to be quite similar to its self-adjoint 
analogue. 


Singular Integral Operators 


Singular integrals is a classical and actively studied field in Harmonic Analysis, 
and rank one perturbations serves as a source of very interesting problems. Many 
results for rank one perturbations are obtained by investigating fine properties 
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of singular integrals. For example, investigation of the boundary behavior of the 
Cauchy transform of measures lead via the so-called Aronszajn—Krein formula, 
see (4), to the famous Aronszajn—Donoghue theorem stating that singular parts of 
the spectral measures of the family of rank one perturbation by a cyclic vector are 
mutually singular. 

As for a different example, basic facts about Cauchy transform of a measure 
coupled with the Aronszajn—Krein formula (4) give a proof of the famous Kato— 
Rosenblum theorem about preservation of the absolutely continuous spectrum for 
rank one (and automatically for finite rank) perturbations. While the proof for the 
trace class perturbations using the technique of wave operator is probably more 
elegant, the approach of singular integrals gives some helpful insights. 

A deep relation between singular integral operators and rank one perturbations 
is based on the fact that a unitary operator realizing the spectral representation of 
a rank one perturbation is given by a singular integral operator acting L?(ww) > 
L? (tq), where [1g is the spectral measure of the perturbed operator, see [32]; we 
explain this connection in the present notes. We should mention here that the spectral 
measures j4 and j1, can be extremely bad, without any reasonable smoothness, so 
the above operator gives a natural example of a two weight estimates for Cauchy 
type operators with extremely “pathological” measures. 


Clark Measures and Clark Model 


In the paper [12] that started what is now called the “Clark theory” D. Clark 
considered all unitary rank one perturbations of a special contractive operator 
(the so-called model operator with scalar inner characteristic function). He also 
described the spectral measures and the spectral representations of the perturbed 
unitary operators. 

The spectral measures of these unitary rank one perturbations were later called 
the Clark measures. Note, that if we fix one such rank one unitary perturbation, 
then the other unitary rank one perturbations are the rank one perturbations of the 
fixed one. In the original paper [12] all the spectral measures were purely singular, 
but very often the term Clark measures (or Clark family of measures) was used for 
spectral measures of unitary rank one perturbations of a unitary operator, or for the 
spectral measures of self-adjoint rank one perturbations of a self-adjoint operator. 

Later many deep function-theoretic results about Clark measures were proved by 
A. Aleksandrov, (see [3-7], or see [37] for a survey), so sometimes people refer to 
Aleksandrov—Clark theory, or Aleksandrov—Clark measures. Extremely significant 
contributions to the theory were made then by A. Poltoratskii, who, in particular, 
proved the a.e. existence with respect to the singular part of the measure of the non- 
tangential boundary values of the so-called normalized Cauchy transform, see [38]. 

We also mention an important book [39] by D. Sarason where many aspects 
of Clark theory were treated from the point of view of function space theory. In 
particular, a description of the Clark operator was obtained in the case when the 
characteristic function 6 is an extreme point of the unit ball in the Hardy space H%. 
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Within classical analysis many fruitful connections of Clark measures with 
holomorphic composition operators, rigid functions and the Nehari interpolation 
problem have been discovered and studied, see for example, [37]. Some problems 
in the theory of Hardy spaces, and more generally of other spaces of analytic 
functions are closely related to Clark theory. Thus, recently M. Jury [26] computed 
the asymptotic symbols of a certain class of weakly asymptotic Toeplitz operators 
in terms of the Aleksandrov—Clark measures which occur in the context of rank one 
perturbations. 


Plan of the Notes 


These lecture notes give an account of the mini-course delivered by the authors, 
which was centered around [31-33]. Unpublished results are restricted to the last 
part of this manuscript. 


Self-Adjoint and Unitary Rank One Perturbations 


We begin section “Self-Adjoint and Unitary Rank One Perturbations” with an 
introduction of self-adjoint rank one perturbations. We then find a unitary operator 
V, giving the spectral representation of the perturbed operator, see Theorem 2.1 
below; this operator looks like a singular integral operator with Cauchy type kernel 
(s—t)~!, although the formula of the operator looks quite different from the classical 
singular integral operators of Cauchy type. 

In particular, the so-called Rigidity Theorem, see Theorem 2.2 below, holds 
for such operators: it essentially says that if the formula from Theorem 2.1 gives 
a bounded operator with trivial kernel, then, after probably a renormalization 
(multiplication by a non-vanishing weight) of the measure in the target space, we 
get exactly the unitary operator from the perturbation theory, given by Theorem 2.1. 

We then give a different representation of the operator V, that looks more in 
line with the traditional formulas for singular integral operators. Regularizations 
of singular kernels, treated later in section “Singular Integral Operators”, play an 
important role in getting this alternative representation. 

We then present similar results for the unitary rank one perturbations of unitary 
operators. Everything works out similarly to the self-adjoint case; some formulas for 
the unitary case might not look as transparent as the ones in the self-adjoint case, but 
in the unitary case we avoid technical difficulties related to dealing with unbounded 
operators. 


Regularizations of Singular Integral Operators 


Section “Singular Integral Operators” is devoted to the theory of regularization of 
singular kernels, which we believe have applications far beyond the perturbation 
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theory. We show that under very general assumptions about a singular kernel, its so- 
called restricted boundedness implies the uniform boundedness of all “reasonable” 
regularizations of the corresponding formal singular integral operator. 

The restricted boundedness of the kernel is the weakest boundedness property 
of the corresponding singular integral operator. Usually, it is assumed in the theory 
of singular integral operators that a singular kernel K blows up on the diagonal 
x = y, so the formal integral representation Tf(x) = [ K(x, y)f(y)dj(y) is not well 
defined. 

However, even if we only start out with a kernel K (without assuming the we are 
given an operator) for bounded functions f and g with separated compact supports 
the expression 


(Tf, g) = / K(x. yf e@)dp(y)dv(x) 


is well defined, and if the “correct” estimate |(Tf, g)| < C\lf|| wes 1/p+ 


1/p’ = 1 holds for all such pairs, we say that K is L?(jz) > L?(v) restrictedly 
bounded. And we show in section “Singular Integral Operators” that if the measures 
jz and v do not have common atoms and the kernel K is restrictedly bounded, then 
for any “reasonable” regularization K, of the kernel the corresponding regularized 
operators T; are uniformly (in ¢) bounded. This result gives us a way to define for 
each restrictedly bounded kernel a corresponding singular integral operator. 


yplisl 


Clark Model for Contractive Perturbations of Unitary Operators 


Section “Clark Theory for Rank One Perturbations of Unitary Operators” is devoted 
to the Clark theory in full generality. We start with unitary rank one perturbations of 
a unitary operator U by a *-cyclic vector. All such perturbations can be parametrized 
by a scalar parameter y € T; if one takes y € D the resulting operator will 
be a completely non-unitary (c.n.u.) contraction with defect indices 1-1. For such 
a contraction a so-called functional model, cf. [43] can be constructed; in fact 
functional models are the canonical way of investigating non-normal contractions. 

Thus, the perturbed operator U,,, y € D has two unitarily equivalent representa- 
tions: one in the spectral representation of U and the other one in the model space 
for the functional model. The Clark operator is a unitary operator intertwining these 
representations. In Clark’s original paper [12] this operator was constructed for the 
case of the operator U having purely singular spectrum. In [12] the starting point was 
a c.n.u. contraction with inner characteristic function, which — after translation to 
our language — means that the unitary operator U (and thus all its rank one unitary 
perturbations U,,) has a purely singular spectral measure. 

In the general case (general spectral measure, or equivalently, a general scalar 
characteristic function) our approach of starting with perturbations of unitary 
operators looks more natural; in particular, it allowed us to describe the Clark 
operator. Of course, now when we know all the formulas, it is possible to go in the 
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opposite direction and start with a c.n.u. contraction; but using this approach without 
knowing the formulas in advance we would have a hard time getting the results. It 
could well be just our personal preference, but deducing the formulas in our setup 
starting from a unitary operator was a natural and a straightforward process. 

The main problem with the general case of Clark theory is that for general scalar 
characteristic function the model is vector-valued, i.e. the model space consists of 
vector-valued functions (with values in C7), Earlier approaches based on function 
spaces theory, see for example [39], dealt with spaces of scalar-valued functions. 
Some parts of the Clark operator were obtained using such methods, but for the 
full operator one had to honestly write down a complete model space and do all the 
computations. 

The adjoint of the Clark operator is described using singular integral operators of 
Cauchy type. The so-called normalized Cauchy transform investigated by A. Poltor- 
atskii, see [38], plays a prominent role there. The Clark operator itself then can be 
represented via boundary values of the analytic functions. 

In the model theory we adapt the point of view of coordinate-free model by 
N. Nikolski and V. Vasyunin, cf. [35, 36], where by picking different spectral 
representations of the minimal unitary dilation one gets different transcriptions 
of the model. We present a “universal” representation formula, valid in any 
transcription, as well as formulas adapted to two popular transcriptions, the Sz.- 
Nagy—Foias transcription and the de Branges—Rovnyak one. 


Clark Model for Dissipative Perturbations of Self-Adjoint Operators 


The last part, section “Few Remarks About Clark Theory for the Dissipative Case” 
is devoted to the Clark model for the dissipative perturbations of a self-adjoint 
operator. We adapt a common approach that the model space for a dissipative 
operator is the model space of its Cayley transform (which is a contraction), with one 
detail: since the original operator lives in L7(R, jx), we, using the standard conformal 
map between the upper half-plane C+ and the unit disc D, move the model space to 
the real line (half-plane). The results in this section were not presented before. 

Note, that the formulas in this section do not look as elegant as in the case of 
perturbations of unitary operators. Probably, a different approach to the model of 
dissipative operators would be more appropriate, but we do not know a serious 
contender yet. As a pure speculation, the de Branges spaces £(y) could serve 
as appropriate model spaces for the dissipative perturbations. These spaces were 
introduced in the first chapter of [13], but were not much investigated, unlike the 
spaces H(E) which were investigated in details in [13] and were subject of extensive 
research by many authors. 
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Self-Adjoint and Unitary Rank One Perturbations 


Self-Adjoint Rank One Perturbations 


For a self-adjoint (possibly unbounded) operator A on a separable Hilbert space H. 
let us consider the family of rank-one perturbations A,, a € R, given by 


Ag :=A+aQ(-,9)4/P on H. (1) 


Here, if the operator A is bounded, then ¢ is a vector in H. For unbounded A, we 
can consider the wider class of “singular form-bounded” perturbations where we 
assume gy € H_\(A) D H, where H_,(A), r € N, is the completion of H with 
respect to the norm || - llay_-cay? IF lay cay = ||7+ |Al)~"/7Fl4,- In particular, the 
perturbation a(-,g)gy can be unbounded (see [29, 32] and the references within 
for further details). If # = L7(IR, 2) and A = M, is the multiplication by the 
independent variable, 


Mf (t) = f(t), VteR, 
then H_| (A) is exactly the collection of measurable functions such that 


Fa? 
rR 1+ |e 


du(t) < oo. 


For r > 2 the formal expression (1) does not uniquely determine a self-adjoint 
operator: For fixed a there is a family of self-adjoint operators corresponding to (1). 
For this reason we do not consider this case, but rather assume that r < 2. 

Without loss of generality we can assume that ¢ is cyclic for A, that is, 


H = clos span{(A — AI)~'g: A € C\ R}. 


Otherwise, i.e. if H = clos span{(A—AI)~“'@ : A € C\R} CH, then we restrict our 
attention to the action on 1 as the perturbation is trivial (does nothing) on H © H. 

Then according to the Spectral Theorem the operator A is unitarily equivalent 
to the multiplication M, by the independent variable in a space L?(1) where ju is 
a spectral measure of the operator A. Spectral measure is of course not unique, 
multiplying a spectral measure by a non-vanishing weight (i.e. by a function w € 
Line(H), W > 0 [-a.e.) we get a different spectral measure. 

It is customary in the operator theory and mathematical physics to consider the 
canonical spectral measure to be the spectral measure associated with the “vector” 
¢, i.e. the unique measure jz such that 


d 
Fa) = (A= anm'p.e) = | FO. 
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In this case g is represented by the function 1, and the assumption that g € H_,(A) 
means simply that /,(1 + |x|)~'du(x) < 1. 

Knowing function F one can say a lot about the spectral measure ju: since 
the imaginary part of F is (up to the factor zr) the Poisson integral of 44 we can 
immediately conclude that the density of the absolutely continuous part of jz is given 
by the non-tangential boundary values of 27! Im F(z) (such values exist a.e. by 
classical results). It is also not hard to show that the singular part of jz is supported 
on a set where the (non-tangential) boundary values of Im F are infinite. 

In the heart of the theory of rank one perturbations lies the simple fact that there 
is a simple relation between the function F and the corresponding functions Fy, for 
the perturbed operators. 

Namely, the following simple formula for the inverse of the rank one perturbation 
of the identity is well known 


(1- (6a) 14 5( a: 2) 


here d = 1 — (b,a) is the so-called perturbation determinant, and the operator is 
invertible if and only if d 4 0. The proof of this formula is an easy exercise, we 
leave it to the reader. 

Using the above formula (2) one can easily compute the resolvent of the 
perturbed operator Ay, 


a ((A — ADs, 9) 


Gg AAI) =F hoo) 


(A-—AD'p (3) 


which immediately implies the relation between the function F' and the corre- 
sponding functions Fy, Fy(A) := ((Ag — AI)~'¢g,@), commonly known as the 
Aronszajn—Krein formula: 


F 
Fy = : 
l+aF 


(4) 


If {44 denotes the spectral measure of the perturbed operator Ay associated with 9, 
then 


dt 
Fu(a) = f Be 


(Note that it is not hard to show that if g is cyclic for A, then g is also cyclic for Ag 
and therefore A, is unitarily equivalent to the multiplication operator M, in L?(j1,)). 
Many classical results in perturbation theory can be obtained from the 
Aronszajn—Krein formula (4) and classical results about boundary values of the 
Cauchy transform. 
For example, it is not hard to show that all the absolutely continuous parts of 
the measures [ly are equivalent (i.e. mutually absolutely continuous), which is just 
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the Kato—Rosenblum theorem for rank one perturbations. Also, the analysis of the 
singular parts of the measures /Z, yields the famous Aronszajn—Donoghue theorem, 
stating that the singular parts of {1g are mutually singular. 


Rank One Perturbations and Singular Integral Operators 


We find a sufficient condition on the absence of singular spectrum by studying the 
spectral representation, which comes in the form of a two weight Hilbert transform. 
Part of this material can be understood as a first example for section “Singular 
Integral Operators”. 

Consider a family of rank one perturbations given by Ay := A+a(-, ¢)g, see (1), 
where y € H_,(A) is cyclic for A. Let jz denote the spectral measure of operator 
A with respect to g, so A is unitarily equivalent to the multiplication operator M, 
in L?(j). Let us consider the operator A in its spectral representation, i.e. let us 
assume that A is the multiplication operator M, in L*(1). As we discussed before, 
the assumption that y € H_,(A) means simply that f,(1 + |x|)~'du(x) < oo, 

Then the operator A, is defined by 


Ag =A+ a, 9)9 =M, + a(-,151 
on L?(j1). On the other hand, the operator Ay is unitarily equivalent to the 
multiplication M, by the independent variable s in L7(1,) (we use a different letter 
for the independent variable here to distinguish between the multiplication operators 
in L*(1) and L*(j1~)). 

We want to find a unitary operator giving the spectral representation of the 
operator Ag, i.e. a unitary operator 


Vu? L’(p) > L? (Ma) 
such that 
Vy Ag = M,V,. 


We also want ¢ to be represented by 1 in both representations, which translates to 
additional condition V,g = 1. 

Theorem 2.1 of [32] gives the representation of V, as the Hilbert transform type 
singular integral. 


Theorem 2.1 (Representation Theorem) Under the above assumptions the spec- 
tral representation Vy : L?(1) > L? (a) of Aq is given by 


fer —fO 


Vaf(s) = f(s) — dt) (5) 


for all compactly supported C' functions f. 
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Without going into the details of the proof, we indicate the proof strategy for 
bounded operators A. The intertwining condition 


MsVq = VyAa = Va(M; + a(- ’ P)~) (6) 
can be rewritten as 
VM, = Ms Vu _ a(- ; ~)VaQ.- 


Using induction we get 


n—1 


VeM" = MV, — a S-(-, Mig) MV 9. 
k=0 


Recalling that ¢ = 1, Vag = 1, we get by applying the above identity to g and 
denoting f,,(t) := t” we get 


n—1| 
Votn(s) = s” — a | (x a) du(t). 
R \k=0 


Summing the geometric progression under the integral we get the representation 
formula (5) for f = far, fr(t) = t”. Linearity of (5) implies that it holds for all 
polynomials, and rather standard approximation reasoning allows to extend this 
formula to the case of compactly supported C! functions. 

This reasoning, of course, works only for bounded operators A (i.e. when 
the measure jz is compactly supported). In the case of unbounded operators the 
resolvent identity (3) is used instead of (6), see [32] for the details. 

Aside we mention that integral operators represented by formula (5) are very 
interesting objects, probably deserving more careful investigation. Without proof 
we mention one property (see Theorem 2.2 of [32]), which can be understood as a 
converse to the last Representation Theorem. 


Theorem 2.2 (Rigidity Theorem) Let measure on R be supported on at least 
two distinct points and satisfy [(1 + |t\)~!du(t) < oo. Let V be defined on 
compactly supported C' functions f by formula (5). 

Assume V extends to a bounded operator from L?(w) to L?(v). Assume 
Ker V = {0}. 

Then there exists a function h such that 1/h € L®(v), and M,V is a unitary 
operator from L?() — L?(v) (equivalently, that V : L?(dy) — L?(\h|? dv) is 
unitary). 

Moreover, the unitary operator U := M,V gives the spectral representation of 
the operator Ay := M, + a(-,)¢, ¢ = 1, in L?(w), namely UAy = M,U, where 
M, is the multiplication by the independent variable s in L?(v). 
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The integral in the representation formula looks like a singular integral operator, 
but not exactly in the traditional sense. The attempt to understand the precise con- 
nection with the theory of classical singular integral operators lead us to the theory 
of regularizations. We describe these results in more detail in section “Singular 
Integral Operators” on general abstract singular integral operators. 

But now, let us first notice that 


= a0 ff EO aun anes 


(Vaf', 8) 12640) = 
for all f € L?() and g € L?(f1,) with separated compact supports. This equality 
is trivial for compactly supported C! function f and g (with separated compact 
supports) and can be extended to the general case by a standard approximation 
argument. 

Since V, : L?(4) > L? (fq) is further unitary, the kernel K(s,#) = 1/(s — f) is 
what we call restrictedly bounded kernel, see Definition 3.1 below. 

An application of Theorem 3.2 and Remark 3.3 shows the following result. 


Theorem 2.3 For the measures [L, [ly as above, the operators T, : L?() > 
L*(Ha), 


fd 


Rs—tt+ie 


T.f(s) -= du(t), 


are uniformly (in €) bounded. 


Uniform boundedness of the operators T; implies that there exists a w.o.t. limit 
point of T,, as ¢ > 0. In fact, it can be shown that this limit point is unique if ¢ > 0* 
or € — 0°, so we can Say that there exist a w.o.t.-limits 74 = w.o.t.- lim,_,9+ Ts. 

The existence of w.o.t. limits follows, for example, from the lemma below the 
fact that for Imz > 0 and for Imz < 0 the non-tangential boundary values of 


Rf (2) := Jr ae exist [q-a.e. 


Lemma 2.4 For any f € L*() the non-tangential boundary values of Rf 1(z) = 


He z€ Cy orze C_ exist Uy-a.e. 


Proof The a.e. convergence with respect to Lebesgue measure (and so with respect 
to the absolutely continuous part of {1_) follows from classical facts about boundary 
values of functions from Hardy spaces: for f > 0 the function Rfj(z) has non- 
positive imaginary part, so composing it with a conformal mapping from the lower 
half-plane C_ we get a bounded analytic function, which has non-tangential limits 
on R a.e. with respect to Lebesgue measure. Representing arbitrary complex-valued 
function as linear combination of 4 non-negative ones we get the a.e. existence (with 
respect to Lebesgue measure) in general case. 

To prove the convergence with respect to the singular part (1); of Ua we get by 
applying functional ¢ to the resolvent formula (3) and denoting fy = Vaf that 
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Rf 
Rfa a = ——— . 
1+ aR 
By Poltoratkii’s theorem, see [38, Theorem 2.7], the non-tangential boundary values 
of Rfya/R[La exist (and coincide with fy) ({4~)s-a.e. Combining the above identity 
with the Aronszajn—Krein formula (4) we get 


Rfu bea = Rf (7) 
Riba Ru 


But it follows from the Aronszajn—Krein formula (4) that the non-tangential 
boundary values of F = Ry exist (and equal to —1/a) (q)s-a.e. Indeed 


ImF Im F 
nS ee 

and Im Fy, function is the Poisson extension (up to the factor zr) of the measure [y. 
Therefore, since the singular part of the measure //q is supported on the subset of 
R where non-tangential boundary values of Im F, equal +-oo, we can conclude that 
the non-tangential boundary values of F equal —1/a ({tq)s5-a.e. 

Since the non-tangential boundary values in (7) exist ({q)s-a.e., we conclude that 
same for Rf 2. oO 


The above Lemma 2.4 implies the w.o.t. convergence of T, as ¢ > 0* or é > 
O-. Indeed, Lemma 2.4 implies the fy-a.e. convergence, which, in turn implies 
that any weakly convergent subsequence of T,f converges to the same function (the 
a.e. limit). And this, as one can easily see, means that T,f has a weak limit as e > 0+ 
ore> 0. 

So, we can define the operators T+ either as w.o.t. limits of T, as e — O* or 
define Tif as the non-tangential boundary values of Rf(z), z € Cx. 

Using the operators T+ we obtain an alternative representation formula, see 
Theorem 3.2 of [32]: 


Theorem 2.5 Let pt and [Ly be the spectral measures of A and Ag, and let T+ be as 
defined above. 
Then V,, can be written as 


Vof(s) =f(s)\Q-aTsI)+aTsf, Wf elu). (8) 


Proof Consider operators Vé : L?(w) > L?((a), 


f(s) —fO 


s—tt+ie 


Vi (s) = f(s) — du(t) = f(s) —aT.1(s)) + aT f(s) 
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and notice that for compactly supported C! function f the functions Vé f(s) converge 

uniformly and in L?(j1q) to Vp as € > 0. Together with uniform bounds on 7, this 

immediately implies that V’ converges in the strong operator topology to Vy. 
Taking w.o.t. limits we arrive to the representations (8). Oo 


Remark Note that for the existence of the w.o.t. limits of T, it is sufficient to have 
[4q-a.e. convergence on a dense set. As we just discussed above, for compactly 
supported f € C! the functions Vf converge uniformly to Vy. It was also 
shown in the proof of Lemma 2.4 that 7,1(s) = —F(s + ie) converges [y-a.e., 
which immediately implies j4,-a.e. convergence of T.f for compactly supported C! 
functions. 

This approach was used in [32]. 


Unitary Rank One Perturbations 


In this section we present the analogues of the Representation Theorem 2.1 and 
the Rigidity Theorem 2.2 for the case of unitary rank one perturbations of unitary 
operators, that were proved in [31, Section 8]. 

We should mention, that these results cannot be obtained just by taking the 
Cayley transform of the self-adjoint case, we will explain this in section “Few 
Remarks About Clark Theory for the Dissipative Case”. 

In the contrast with the self-adjoint case the description of all unitary rank one 
perturbations of a unitary operator is not immediately self-evident, but with a little 
effort one could see that all unitary rank one perturbations of a unitary operator U 
can be parametrized as 


Ura =Ut(a—1)(-,U*b)b = bEH, |b) = 1, weT. (9) 


The fact that this formula indeed gives us the parametrization of the unitary rank one 
perturbations can be easily seen in the case U = I; the general case then is obtained 
by right multiplying the formula for the perturbation of I by U. 

In what follows we assume that the vector b is fixed and use the notation U, for 
Up, SO our perturbations will be parametrized by the scalar parameter a € T := 
{zeEC: |z| = 1}. 

Since the action of perturbation (- , U*b)b is trivial (zero) on (span{U"b : n € 
Z)+, we can ignore what is going on there and assume without loss of generality 
that b is *-cyclic vector for U, meaning that span{U"b : n € Z} = H. 

Then by the Spectral Theorem U is unitarily equivalent to the multiplication by 
the independent variable € in L?(jz) = L?(T, j4), where ju is a spectral measure of U. 
As in the self-adjoint case we fix a spectral measure jz to be the spectral measure 
corresponding to the vector b, so yz is a probability measure and the vector b in the 
spectral representation is given by the function 1. 
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So, as before let us assume that U is not just unitarily equivalent, but is a 
multiplication operator M; by the independent variable € in L*(u) = L*(T, 1), 
A(T) = 1 and the rank unitary perturbations U, are given by (9) with b = 1. 

It is not hard to show that if b is *-cyclic for U then it is also *-cyclic for 
Uy = Up, $0 Uy is unitarily equivalent to the multiplication M, by the independent 
variable z in L?(j1y). We take for jg the spectral measure corresponding to the 
vector b, so b = 1 in the spectral representation of U, in L?(1q). 

Under these assumptions we want to describe the unitary operator giving the 
unitary equivalence between U,, and its spectral representation of, i.e. the unitary 
operator V, : L?() > L?(,) such that V,1 = 1 and 


VyUy = MVy- (10) 


In Theorem 8.1 of [31] we proved: 


Theorem 2.6 (Representation Theorem) Let V, : L?(j1) > L?({q) be a unitary 
operator satisfying (10) and such that Vy1 = 1 (which means that [Ly is the spectral 
measure of Uy corresponding to the cyclic vector b, b(€) = 1). Then 


w [Oar f@) 


Va (z) = f(z) + A = du(é) for alfecgt). (dy 


Proof The proof goes similarly to the proof of the self-adjoint case (Theorem 2.1 
above) for the bounded perturbations sketched above. Namely, using “linear alge- 
bra” notation, i.e. identifying b € H with the operator b : C > H, b(a) = ab and 
denoting by b* its adjoint b* : H. —> C, b*(x) = (x,b),, we can write 


Uy = U + (a — 1)bby = Mz + (a — 1)bd7, 
where b; := U*b. Then the intertwining relationship (10) gives us 
VyU = MzVy + (1 — &)(Vab)d5. (12) 
Inductively one can show that for n > 0 
VU" = M'V_ + (1 — 0) ¥ ME" (Vyb) ((U*)" bi)" 
k=1 
Applying this formula to the function b = 1 € L*(j2) and recalling that (U"b)(€) = 


E" Vob = 1, bi (E) = &, (Ut)"*b, = &"-**! we obtain summing the geometric 
series 


ies 


(Vi£")(2) = 2" + (1a) i = ule) (13) 
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The action of V, on E "n= Ois proved similarly. Namely, taking the adjoint of the 
intertwining formula V,U, = M-,V, and right and left multiplying by V, we get 
that VyU* = MzV,y, so 


VyU* = My + 1 — &)(Veb1)b*. 


But that is exactly the intertwining relationship (12) with U* = U7! instead of U 
and Mz = M,-: instead of M,. So applying the same reasoning as above we get 
that (13) holds also for n < 0, and therefore for all trigonometric polynomials. 

A standard approximation argument concludes the proof. Oo 


A converse of the Representation Theorem is also true in the unitary setting. 
Under mild conditions bounded injective operators V : L?() — L?(v) that are 
given by (11) induce a Clark family. More precisely, we quote Theorem 8.4 of [31]. 


Theorem 2.7 (Rigidity Theorem) Let a probability measure tz on T be supported 
on at least two distinct points. Let a € T\ {1}, and let Vf be defined for C! functions 
f by the right hand side of (5). 

Assume V extends to a bounded operator from L?(w) to L?(v) and assume 
Ker V = {0}. 

Then there exists a function h such that 1/h € L®(v), and My is a unitary 
operator from L?(w) — L?(v) (equivalently, that V : L?(du) > L?(\h|? dv) is 
unitary). 

Moreover, the measure |h|*v is exactly the Clark measure [lq defined as above, 
and V treated as the operator L?() — L?({tq) is exactly the operator Vy from 
Theorem 2.6. 


As in the self-adjoint setting, the Representation Theorem reminds us of singular 
integral operators. Acting as in the self-adjoint case we show that the kernel 
K(z,€) = 1/(1 — &z) is restrictedly bounded (see Definition 3.1 below). Again, 
Theorem 3.2 and Remark 3.3 show the uniform boundedness of the regularization 
of the singular integral operator. 


Theorem 2.8 For the Clark measures | and |ty, the operators T, : L?(&) > 
1?(j1a) given by 


f(§)du) 


Tf (2) = 1_ rE 


are uniformly (in r € Ry \ {1}) bounded. 


An analog of Lemma 2.4 holds for the unit circle with essentially the same proof 
(for a different proof, see [31, Proposition 8.2]), so the limits lim,_, ;+ 7,.f(z) exist 
[Lq-a.e. on T. So we can define operators T+ as the [1-a.e. limits 


Taf (z) := ee T,f (2), zeT, 
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or, equivalently, as w.o.t. limits 


Taf = w.ot- lim Tf. 


r>1t 


Replacing the kernel in (11) by 1/(1 — r&z) and taking the limit as r > 1*, we 
get an alternative formula for V,. 


Theorem 2.9 Let tt and [ly be the spectral measures of U and Uy respectively, and 
let T+ = w.o.t.-lim,_,,+ T, (the existence of the limit was just discussed). Then Vy 
has the alternative representation 


Vef = [1-(1-a)TzIf+(l-a)Tsf Vf € L(y). 


How Unstable Can the Singular Spectrum Become? 


By the Kato-Rosenblum theorem we know that the absolutely continuous spectrum 
remains invariant under rank one perturbations. But under a rank one perturbation 
by a cyclic vector, the singular perturbation can change type, as was shown by an 
example by Donoghue. So the question becomes: To which extent may the spectral 
properties of the measures {4g vary as we change a? Much work has been done and 
many interesting examples were discovered, several are included in [40]. 

First of all notice that in the context of rank one perturbations for pure point 
and the singular continuous spectrum can behave quite different. For example, it is 
possible for Ay to have purely singular continuous spectrum on the interval [0, 1] 
for all w. But the same behavior is not possible for pure point spectrum. In fact, the 
perturbations A, have pure point spectrum for all a if and only if the spectrum is 
countable without accumulation points. 

Another question concerns the type of parameter sets that allow dense singular 
embedded (in absolutely continuous) spectrum. For several years, all examples 
exhibited dense singular embedded spectrum only for a Lebesgue measure zero set 
of parameters a. It came as a surprise when Del Rio, Fuentes and Poltoratskii [15] 
proved the existence of a family of rank one perturbations with dense absolutely 
continuous spectrum and dense singular spectrum for almost every parameter q@ in 
an arbitrary (previously given) set B C R and with purely absolutely continuous 
spectrum for almost every a € R\B. Their proof uses Clark theory. Via a 
complicated construction they show the existence of a characteristic function for 
which the corresponding family of rank one unitary perturbations has the desired 
properties. In fact, it is possible to produce most any type of singular spectrum in 
this setting, see [16]. In the last reference, it was remarked that replacing the words 
‘almost every’ by ‘every’ in their statement would be a non-trivial improvement, 
requiring a rather different approach. Namely, they suggested the following open 
problem: Fix an interval J C R and a measurable subset B C R. Can one find a 
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family of measures j1g so that dense singular spectrum on J occurs precisely when 
B € B (and the corresponding operator has purely absolutely continuous spectrum 
on / for all B € R\B)? 

A class of examples is concerned with the question of how unstable the spectral 
type may be, if we do not have absolutely continuous part. A result of Del Rio, 
Makarov and Simon [17] which was independently proved by Gordon [22] states 
the following. Consider J C supp yz closed and not a singleton. If wa, is singular, 
then the set of w’s for which j,|; is purely singular continuous is a dense Gs set. 

A converse to this result was presented by C. Sundberg [44]: For any closed 
subinterval J which is not a singleton and any Gs subset of R, there exists a family of 
measures (corresponding to a family of rank one perturbations) such that supp w C 
I, [ty is purely singular continuous for a € G and [ly is pure point for aw € R\G. In 
the proof, Sundberg applies Clark theory. He constructs the characteristic function 
by defining a function on a Riemann surface 7 over the disk ID, and then applies the 
projection from R to 


Behavior of the Singular Continuous Spectrum 


To this day, a characterization of the singular continuous part of the perturbed 
operator’s spectral measure in terms of the unperturbed operator remains an open 
problem. Several sufficient conditions for the absence of singular continuous 
spectrum are known (see, for example, [11, 40]). Within the realm of our methods, 
an application of Theorem 2.3 empowers us with control over singular spectrum of 
the perturbed operator. 


Lemma 2.10 (Lemma 4.4 of [32]) Operators Ay, a € R \ {0}, have a pure 
absolutely continuous spectrum on a closed interval I, if 


é€ 
i x wh de = oO, 
0 


Here du = wdx+ dy, (w € L' (dx)) is the Lebesgue decomposition, and wy denotes 
the increasing rearrangement of w on I. 


This result allows a construction of unperturbed operators A with arbitrary embed- 
ded singular spectrum and for which all of the perturbed operators Ay, a # 0 have 
no embedded singular spectrum. 
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Singular Integral Operators 
Preliminaries 


The Hilbert transform T 


f(y)dy 


R*—y 


Tf (x) = 


is an example of what is usually called a singular integral operator. “Singular” here 
means that the kernel K(x, y) of the operator is not integrable in y near the diagonal, 
so in the formal expression Tf(x) = [ K(x, y)f(y)dy the integral is not well defined. 

In the case of Hilbert transform it is very easy to show that the integral in the 
sense of principal value is well defined for C! compactly supported functions, so 
the operator is defined on a dense set in L? (and L?, 1 < p < ov). It also can be 
shown that it can be extended to a bounded operator there. 

Moreover, it can be shown that the integral in the sense of principal value exists 
a.e. for all f € L?, 1 < p < ox; the proof is not as easy as for the C! functions, and 
is, in fact, quite involved. 

A part of the operator V, from Theorem 2.1 looks like the Hilbert transform, with 
the difference that the integration there is with respect to a general Radon measure 
p. And what makes things even more complicated, is that the target space is L?(1,) 
with {4g being a new measure. 

In the theory of singular integral operators, there are several ways to define 
such an operator rigorously. One of the accepted ways, is what one would call the 
axiomatic approach. Namely, to define a singular integer operator T : L?(u) > 
L?(v) with kernel K we assume that we are given its bilinear form, defined on a 
dense subset of L’() — L?’(v), 1/p + 1/p' = 1. The fact that 7 is an integral 
operator with kernel K means simply that 


(Tt) = / K(x. yf) g@du()dv(x) (14) 


for all (say bounded) f and g with separated compact supports. Since the kernel K 
blows up only on the diagonal x = y, the integral above is well defined. Note, that 
according to this definition the multiplication operator M,, Myf = gf is an operator 
with kernel K(x, y) = 0. 

Moreover, it can be shown that any bounded singular integral operator with 
kernel K = 0, where kernel is understood in the sense of (14), is a multiplication 
operator. So, according to the axiomatic approach, any two bounded singular 
integral operators that differ by a multiplication operator are identified as equal. 

Another way to define the singular integral operator with kernel K is to consider 
the truncated operators T,, 
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Tef (x) = / _ Kes snfoyay 
x—y|>e 


which under usual assumptions about kernel K are well defined for bounded 
functions f with compact support. And we say that the integral operator with kernel 
K is bounded if all operators 7; are uniformly bounded. If the operators T, are 
uniformly bounded, we can take w.o.t. limit of T; as ¢ > 0*, so in this case K is 
indeed a kernel of a bounded singular integral operator in the sense of the axiomatic 
approach. 

Moreover, in all known examples if an axiomatically defined operator T is 
uniformly bounded then the operators 7; are uniformly bounded. And as it turns 
out, this is not a coincidence, but a corollary of a very general fact. 


Setup 


In this paper we assume that jz and v are Radon measures on R¢ and that K belongs 
to Li,.(u x v) off the diagonal x = y, meaning that for any x9 # yo there exists a 
neighborhood G of (x9, yo) € IR? x R¢ such that K 1,¢€ L?(u x v). Note, that these 
assumptions are weaker than what is usually assumed about the kernels of singular 
integral operators. 

The main results are also true for (at least some) locally compact abelian groups, 
in particular for tori T“. Also, since everything is local, the results can be modified 
to hold on smooth manifolds. 


Definition 3.1 Let K € L?,.(u x v) off the diagonal x = y. We say that K is 
I?(u) — LP(v) restrictedly bounded if for all f ¢ L©(w), g € L®(v) with 


separated compact supports 


dees (15) 


 [ Keonroretawordv09] < Cll, hel 
The best constant C in (15) is called the L’(4) — L?(v) restricted bound of K, and 


denoted by KT GPO)" 


If the exponent p and the measures ju, v are fixed, we will skip (4) > L?(v) 
and simply say restrictedly bounded. 


Going back, we can see that the operator V, from Theorem 2.1 is a singular 
integral operator (in the sense of axiomatic approach) with kernel K(s,t) = 
a/(s — t). Since V,, is a unitary operator L?(j1) > L?({1q) its norm is 1 and therefore 
the kernel a /(s—s) is restrictedly bounded with the L?(j1) > L(j1q) restricted norm 
at most 1. Equivalently, one can say that the L?(j1) > L?(j1q) restricted norm of the 
kernel 1/(s — f) is at most 1/|a]. 
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Similarly, the operator V, from Theorem 2.6 is a singular integral operator with 
kernel K(z,&) = (1 — a)/(1 — &z), z,€ € T, and the L?() > L?(1q) restricted 
norm of the kernel 1/(1 — &z) is at most 1/|1 — aq]. 


Regularizations of Singular Kernels 


Let m : R¢ — R be a regularizer, i.e. a bounded function which is 0 in a 
neighborhood of 0 and 1 in a neighborhood of oo. Define the regularized kernel 
K, by K.(x,y) = K(x, y)m((x— y)/e). The regularized kernels K, are in L7,.(( x v) 
so the regularized integral operators T,, 


rf) = f KG. O)dwO) 


are well defined for bounded compactly supported f. In particular, if mx) = 
1 Ah:ee} (|x|) then we get the classical truncation 


rf) = [ Kenrordno. (16) 


If for 1 < p < oo operators T, : L?(jz) — L?(v) are uniformly bounded, then 
by taking w.o.t. limit point as e > 0* we conclude that K is a kernel of a singular 
integral operator (in the sense of the axiomatic approach) with kernel K, acting 
TP (pt) > P(r). 

It turns out that the converse statement is true, even in a stronger sense, if 
we assume that the measures yz and v do not have common atoms. Namely, the 
following theorem holds, see [33, Proposition 2.12]. 


Theorem 3.2 Let a kernel K be L?(t1) > L?(v) restrictedly bounded, and assume 
that 4 and v do not have common atoms. Then for any regularizer m € C°® 
the regularized operators T, are uniformly (in &) bounded, ITelloqy—se 
C(m) < oo. 


w = 


Moreover, for all “interesting” kernels the L?(u) — L?(v) restricted bounded- 
ness implies the uniform boundedness of the classical truncations (16). 

Without going into details, we just mention that the “interesting” kernels include 
kernel 1/(x — y), x,y € R of the Hilbert transform, the kernel (x — y)/|x — y|*T!, 
a>d0,x,ye R¢ of the generalized Riesz transform R, in R¢, the kernel 1 /(z—w), 
z,w € C of the Cauchy transform, the kernel 1/(z— w)?, z, w € C of the Beurling— 
Ahlfors transform and many others. For more information please refer to [33]. 

Regularizations with smooth functions m seem to be a more logical and 
convenient choice, than the classical one; for example if one starts with a Calderén— 
Zygmund kernel then after smooth regularizations the resulting kernel will still be 
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a Calderén—Zygmund one with uniform estimates of the constants. However, the 
classical truncations are used most. 


Remark To define truncation of a kernel on the unit circle T we take the function m 
on the line, m = 0 in a neighborhood of 0 and m = | in a neighborhood of oo, and 
define functions m, on T by 


m,(e") = m(t/e), —m <t<7. 
Then the regularized kernel K, is defined as 


K,(z, €) = K(z, €)me(z/), Zé€eT. 


The regularized kernels on T? are defined similarly, and the same results as in R¢ 
holds in T¢. 


Remark 3.3 For singular integrals related to complex analysis there is another type 
of natural regularization. Namely for the kernel K(x, y) = 1/(« — y) on R one can 
consider kernels 


Kze(x,y) = 1/(~—y tie). (17) 
Similarly, for the kernel K(z, £) = 1/(1 — &z) on T define the regularized kernel 
K,(z, €) = 1/(1 — réz), O<r<o rFi. (18) 


For these kernels Theorem 3.2 holds as well. 


Now let us discuss the main ideas of the proofs. 


First Step: Schur Multipliers 


The first idea is very simple: we want to multiply a restrictedly bounded kernel by a 
function M such that the resulting kernel is still restrictedly bounded. 


Definition 3.4 We call a function M(-, -) an L?(tz) > L?(v) Schur multiplier if for 
any L? (jz) — L?(v) restrictedly bounded kernel K the kernel KM is also L?(“) > 
L?(v) restrictedly bounded and 


T < T 
KM pu) ~ CD oy sve)’ 


The best constant C in the above inequality is called the Schur norm of M. 


Traditionally, Schur multipliers are defined with respect to the operator norm of 
the corresponding integral operators, or with respect to the Schatten—von-Neumann 
norm, but our definition is very close in spirit, so we use the same term. 
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Formally, our definition depends on jz, v and p, but we will construct “universal” 
multipliers, that work for all 4, v and p with the same estimate on the Schur norm. 
They also are Schur multipliers with respect to the operator norm, as well as with 
respect to the Schatten—von-Neumann norms. 

Thus, in what follows we will omit 1?(u) — L?(v) and simply say Schur 
multiplier. 


Constructing Schur Multipliers via Fourier Transform 


We start with an elementary observation: the function M,, Ma(x, y) := e laxeiay | 
a,x,y € R¢ is a Schur multiplier with the Schur norm 1 (as a product of two 
unimodular functions of one variable). 

Averaging in a we get that if o is a complex-valued measure of bounded variation 
and m = @ is its Fourier transform, 


G(s) := - e "da (t) 


then the function M(x, y) = m(x — y) is a Schur multiplier with the Schur norm at 
most varo. 

Note also that for m,(s) = m(s/e) and the measure o; defined by o,(E) = o(eE) 
we have m, = G;. Since var o, = varo we get that all the functions M, 


M(x, y) = me(x — y) = m((x— y)/e) 


are Schur multipliers with the Schur norm estimated by varo. 

Since a compactly supported C® function is a Fourier transform of an L! 
function (it is a Fourier transform of a Schwartz class function), and 1 is trivially a 
Schur multiplier, we can conclude that functions M,, M.(x, y) = m((x—y)/e) where 
mis the C™ regularizer defined in section “Regularizations of Singular Kernels” are 
Schur multipliers. 

So we see that the regularized kernels K, obtained using smooth regularizers m 
are restrictedly bounded with the uniform (in ¢) estimate on the restricted norm. 

To get the corresponding result for the torus T’ we just need to restrict the 
regularizers m, to the cube (—z, 2] and then map the cube to the torus via the 
standard map. 


Cauchy Type Regularizations 


Let us now discuss the Cauchy type regularizations (17) and (18). For p(x) = 


Vio.coy® * define 
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a s 
m(s) = 1— p(s) = =e 
s-i 
Then m,(s) = m(s/e) = s/(s—ie), and the functions M,(x,y) = m,(x — y) are 
Schur multipliers with Schur norm at most 2. Computing the regularized kernel we 
get 


1 x-y 1 


K, ? = = ’ 
&y) x-—yx-y-ie x—(yt+ie) 


so the kernels K, from (17) are uniformly restrictedly bounded. 

Repeating the same reasoning with p(x) = 1 e* we get the conclusion 
for K_,. 

For the kernel (18) on T we use the Fourier transform on Z. Namely, it is easy to 
show that if a € €'(Z) and m(z) := Y“ypez a,.z*, z € T, then the function M 


M(z,§) =m(z/§) zz, € © T 


is a Schur multiplier with Schur bound at most ||a lla ; 
Then for 0 < r < 1 multiplying K(z, ) = 1/(1 — &z) by 


(—oo,0] 


= — _ pl EB\n _ 1-& 
m(z/&) = Pe )(Ex)" = 2 


we at most double the restricted norm (because 1 + }°°2, |r?—r""!| = 14+r <2). 
So, for the kernel 


1-& 1 
K(z, &) =. a eed 


we get forr < 1 


Kelis = ls): a) 


For r > 1 we can write 


m(z/&) = 


a = 1-)'¢ ny @+)) (Ez) a 


a4 n=1 


Noticing that 1 + }°°2, |r-" — r-@+)| = 14 r7! < 2 we see that in the case 
r > 1 (19) holds as well. 
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Final Step: Boundedness of the Regularized Operators 


Theorem 3.5 Let js and v be Radon measures in R4 without common atoms. 
Assume that a kernel K € L},.(je x v) is L?(u) > L?(v) restrictedly bounded, 
with the restricted norm C. Then the integral operator with T kernel K is a bounded 
operator L? (tt) — L?(v) with the norm at most 2C. 


Restricting the kernels to compact subsets exhausting R¢ x R¢ one can easily 
reduce the proof to the case K € L?( x v) (globally, not locally). Then the idea of 
the proof is very simple. Taking bounded compactly supported functions f and g we 
can write 


(fay = / K(x. yf O)g)duG)dv(). 


The main idea of the proof is to construct bounded functions f,,, g, with separated 
compact supports such that f, — 5 f weakly in L?(W), &n — 5 g weakly in L?(v) and 
such that 


lim su ne SOO . lim su a 20 
im sup [fall zrquy < IF lle ey im sup [nll Gy = Isl ray (20) 


Since the operator T is Hilbert-Schmidt, and so compact (as an operator L?(j1) > 
L?(v)) the weak convergence implies that 


1 
(Ths Bn)v = a Sv: 
Therefore, using (20) we get 


\(2F.8)| < limsup 4|(1f,.80)1 S 2CUf lary Bly, 


noo 


The main idea of the construction of the functions f,, and g, is quite simple, at 
least for the absolutely continuous piece: we define f, := 1, »8n *= 1, where 
E,, and F;, are separated “mesh like” subsets, that are well mixed, meaning “that for 
all dyadic cubes Q of size at least 2~” the Lebesgue measure of the sets OM E, 
and Q 1 F,, is almost half (with relative error of say 2~”) of the measure of Q. 
Construction of such sets in for the Lebesgue measure is rather trivial and can be 
left as an exercise for the reader. 

For the measures j2 and v without atoms the construction is almost the same, 
only the “well mixed” property is with respect to the measure 0 = jz + v, meaning 
that for any dyadic cube Q of size at least 2~” the measures 0(Q MN E,), 0(Q0 MN 
F,) are almost half of o(Q) with relative error 2~”. It might not be immediately 
obvious how to construct such sets E,, F,, but the construction is relatively simple 
and straightforward, see [33] for details. 
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The construction in the general case is just a bit more complicated. Namely, we 
first construct the sets E,, and F,, with respect to the continuous parts [Z,, Vv. of the 
measures (making sure that the sets do not contain any atoms). Then we define f, 
and g, by adding to f1 E, and g1 F, the functions 


1 n 1 n 
5 flab» 5 2) 8K) Sr, 
k=1 k=1 


respectively, where ax, by, are atoms of jz and v respectively. To make sure that the 
functions f,, and g, have separated supports, we then just need to “shrink” the sets 
E,, F, by removing small discs around atoms. Again, the reader is referred to [33] 
for the details. 

This idea of using “well mixed” set was exploited in [34] in the case of Lebesgue 
measure. It was later used in [32], where some of the result in this section were 
proved under the assumption that the singular parts of jz and v are mutually singular. 

The results in full generality were proved in [33], the reader should look there 
for full details. 


Clark Theory for Rank One Perturbations of Unitary 
Operators 


Plan of the Game 


As we discussed above in section “Unitary Rank One Perturbations”, rank one 
unitary perturbations of a unitary operator U are parametrized by the formula (9). 
If in (9) we take |a| < 1 (instead of |a| = 1) the resulting operator Uy will be not a 
unitary, but only a contractive (||Uq|| < 1) operator. 

If, as in section “Unitary Rank One Perturbations” we assume by ignoring the 
trivial part that b is *-cyclic vector for U, then for |y| < 1 the operator U, = 
U + (y — 1)bbF, b) = U*b is a completely non-unitary (c.n.u.) contraction. The 
term completely non-unitary means that there is no reducing (i.e. invariant for U, 
and Uy ) subspace on which U,, acts unitarily. 

A completely non-unitary contraction T is up to unitary equivalence determined 
by its so-called characteristic function 0 = 6,,, see the definition below. Namely, T 
is unitarily equivalent to its model M = Mg, where Mg is a compression of the 
multiplication operator M,, 


Mo = PeM, Lor 
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here Kg is a subspace of a generally vector-valued, and possibly weighted L? space 
on the unit circle, Pg = Px, is the orthogonal projection onto Kg, and M, is the 
multiplication by the independent variable z, M_f(z) = zf(z),z € T. 

So, we have two unitarily equivalent representations of the operator Uy, |y| < 1: 
the representation 


Uy =Me+(y-lbby, = b= 1, b, = MFA 


in the spectral representation of U in L?(w), where jz is the spectral measure of U 
corresponding to the vector b, and the representation as the model operator Mg, in 
the model subspace Kg, . 

The Clark theory describes the unitary operator providing this unitary equiva- 
lence, i.e. a unitary operator ®, : Kg, > L? (jt) such that 


®,Mo, = Uy ®,. 


D. Clark in his original paper [12] described such operators for the particular 
case when @, is an inner function. He started with the model operator (unitarily 
equivalent to U,, |y| < 1 in our notation) in a particular case of inner characteristic 
function, described all its unitary rank one perturbations (U,, |w| = 1 in our 
notation) and described the unitary operator between the model operator Mg and 
the spectral representation of Uy, |a| = 1. 

Translated to our language the fact that the characteristic function @ is inner 
means that the operator U (and so all Uy, |w| = 1) have purely singular spectrum. 


A Functional Model for a c.n.u. Contraction 


Let us recall the definition related to the functional model. For a c.n.u. contraction 
T acting in a separable Hilbert space we define the defect operators 


D,:= A-T*T)'”, Dj». = A-TT"*)"”, 
and the defect subspaces 
D = Dr := clos Ran Dr, D, = Dx := clos Ran Dy«. 
The characteristic function 6 = 67 of the operator T is an analytic function 0 = 


Or € AS, whose values are bounded operators (in fact, contractions) acting from 
®D to D, defined by the equation 


6r(2) = (-T + Dr+(I— cT*)~'Dr) 5° zeD. (21) 
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Note that TD C D,, so for z € D the above expression indeed can be interpreted as 
an operator from D to Dx. 

It is customary to assume that the characteristic function is defined up to constant 
unitary factors on the right and on the left, i.e. one considers the whole equivalence 
class consisting of functions U@V, where U : D, — E, and V: E — 9D are unitary 
operators and E,., E are Hilbert spaces of appropriate dimensions. The advantage of 
this point of view is that we are not restricted to using the defect spaces of T, but 
can work with arbitrary Hilbert spaces of appropriate dimensions. 

Note, that the characteristic function (defined up to constant unitary factors) is a 
unitary invariant of a completely non-unitary contraction: any two such contractions 
with the same characteristic function are unitarily equivalent. 

Note also, that given a characteristic function, any representative gives us a 
model, and there is a standard unitary equivalence between the model for different 
representatives. 


Remark Another way to look at a choice of a representative of a characteristic 
function is to pick orthonormal bases in the defect spaces and treat the characteristic 
function as a matrix-valued function (possibly of infinite size). The choice of the 
orthonormal bases is equivalent to the choice of the constant unitary factors. 


In this paper by a functional model associated to an operator-valued function 
6 © HP... we understand the following: a model space Kg is an appropriately 
constructed subspace of a (possibly) weighted space L?(E, @E, W) on the unit circle 
T with the operator-valued weight W. The model operator Mg is a compression of 
the multiplication operator M, onto Ko, 


Mo = PoM, Ko’ (22) 


where Pg = Px, is the orthogonal projection onto Kg. 

All the functional models for the same 6 are unitarily equivalent, so sometimes 
people interpret them as different transcriptions of one object. 

As we already mentioned above, a completely non-unitary contraction with 
characteristic function 6 is unitarily equivalent to its model Mg. 

On the other hand, for any purely contractive 6 € L: ae » ]@lloo < 1 the model 
operator Mg is a completely non-unitary contraction, with 6 being its characteristic 
function. Thus, any such @ is a characteristic function of a completely non-unitary 
contraction. 


Sz.-Nagy—Foias Transcription 
The Sz.-Nagy—Foias model (transcription) is probably the most used. 


The model space Ky is defined as a subspace of L*(E, ® E) (non-weighted, 
WZ) =D, 
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Hi: 0\ 1» 
-_ He H 2 
i ( aastts )0 (A) e 22) 


where the defect A is given by 


A(z) = (1 — 0(z)*0(z))!?, zeT. (24) 


If the characteristic function 6 is inner, meaning that its boundary values are 
isometries a.e. on T, then A = 0, so the lower “floor” of Kg collapses and we 
get a simpler, “one-story” model subspace, 


Ko = H’(Ex) © OH’ (E). 


This subspace is probably much more familiar to analysts, especially when 6 is a 
scalar-valued function. 

The model operator Mg, is defined by (22) as the compression of the multi- 
plication operator M, (also known as forward shift operator) onto Kg, and the 
multiplication operator M, is understood as the entry-wise multiplication by the 


independent variable z, 
m,(%)=(*%). 
“\h zh 


As we discussed above, the characteristic function 0 is defined up to constant 
unitary factors on the right and on the left. But one has to be a bit careful here, 
because if 6(z) = U6(z)V, where U and V are constant unitary operators, then the 
spaces Ky and Kj are different. 


However, the map U 
4 Ug 
Uu — 
(5) = (5) 


is the canonical unitary map transferring the model from one space to the other. 

Namely, it is easy to see that U/ is a unitary map from H*(Ex) ® clos AL?(E) 
onto H?(UEs) ® clos AL?(V*E), where A = Az = V*AV. Moreover, it is not 
difficult to see that UK» = Kz and that U/ commutes with the multiplication by z, 
so Ug : =U | Ka intertwines the model operators, 


UpMeg = Mylo. 
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de Branges—Rovnyak Transcription 


Let us present this transcription as it is described in [35]. Since the ambient space in 
this transcription is a weighted L” space with an operator-valued weight, let us recall 
that if W is an operator-valued weight on the circle, i.e. a function whose values are 
self-adjoint non-negative operators in a Hilbert space E, then the norm in the space 
L?(W) is defined as 


|dz| 
lao = [ were.se), 5. 
T ue 
There are some delicate details here in defining the above integral if we allow the 
values W(z) to be unbounded operators, but we will not discuss it here. In our case 
when the characteristic function is scalar-valued the values W(z) are bounded self- 
adjoint operators on C’, and the definition of the integral is straightforward. 
Let 
I 6(z) 
Wo(z) = : 
O=(gar't) 


The weight in the ambient space will be given by W = we wh = 
(Wo(z))-!| where AI!) stands for the Moore—Penrose inverse of the operator A. 
If A = A* then Al! is O on KerA and is equal to the left inverse of A on Ran A. 
The model space Kg is defined as 


— 1({) : 84 € H(Ex), g- € H2(E), g-—0*g4 © AL*(E)y. (25) 


Remark 4.1 The original de Branges-Rovnyak model was initially described in 
[14] using completely different terms. To give the definition from [14] we need 
to recall the notion of a Toeplitz operator. For g € Lz... the Toeplitz operator 
T, : H?(E) > H’(Ex) with symbol ¢ is defined by 


Tof = P+(gf), f € H°(E). 
The (preliminary) space H(0) C H7?(E,) is defined as a range (I — 


Ty T9*)'/2H?(E) endowed with the range norm (the minimal norm of the preimage). 
Let the involution operator J on L?(T) be defined as 


If(2) = F@.- 


Following de Branges—Rovnyak [14] define the model space D(@) as the set of 
vectors 


114 C. Liaw and S. Treil 


) :g1 € H(O), go € H°(E), such that z"g, — OP4(z"Jgo) € H(O) Vn = 0, 
§2 
and such that 
IC) 
§2 


It might look surprising, but it was proved in [36] that the operator (<*) a 


2 


:= lim (Iiz"s1 — OP + (z"Jg)|I2 


+ IIPx(2'Jg2)I13) < 00. 
D6) noo H(0) 


) is a unitary operator between the described above model space Kg in the de 
8- 
Branges—Rovnyak transcription and the model space D(6). 


Model for the Operator U, 


For the perturbations U,, |y| < 1 the functional model can be computed explicitly. 
The defect operators are computed to be 


Du, = (I- Cf) = (1- ly?) bib, 
Dy,« = (I-U,U,*)"” = (1=|y/?)"? bb" 
and the defect spaces are 


D= Du, = span{b;} and D,. = D5 « = span{b}. 
‘ i 


Note that the defect spaces are one-dimensional, so the characteristic function 0 
6,, is a scalar-valued function. We already mentioned above that 6 € H™, ||6||c0 
1. Note also that the defect spaces do not depend on y. 

The characteristic function 6, of U, can be computed in terms of Cauchy type 
transforms. For a (possibly complex-valued) measure t on T and A ¢ T define the 
Cauchy type transforms R, R; and Ry by 


Rey = [| SO. Rit) = [ee Rot (A) = [| Sare) 


1—EA 
(26) 


lA Il 


If we pick b, and b to be the basis vectors in the corresponding defect spaces, then 
the characteristic function 6, of the operator U,, |y| < 1 is given by 
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Q=lyPRinA) _ A=y)RouQ)-(1+y) 
1+(1-y)RipA) (-y)RowA)+0 +7)" 


6,(A) =-y + 


(27) 


Note that the formulas for 6) (y = 0) are especially simple. And 6 is related to 0, 
by a fractional transformation: 


by +7 


Oo —y 
1+ 76," 


¥ 1-76 


or equivalently A (28) 


To compute the characteristic function one can use the definition (21) of the 
characteristic function with U, instead of T and the inversion formula (2). Namely, 
writing 


I—2U% = (1—zU*) (I — cy — 1) (1 — 2U*) "5, b*) 


and applying the inversion formula (2) we get denoting 6 = y — 1 


1 
(Ba ~ 2U*)-'by, b) 


(—2Uy)' = | 1+ zB(I —zU*) ‘bb* | (—zu*) |. 


In the spectral representation of U in L?(w) the operator (J — zU*)~! is the 
multiplication by the function 1/(1 — &z), b = 1, b\(E) = &, so the above inverse 
can be explicitly computed. Then standard algebraic manipulations lead to the 
formulas (27) for the resolvent. 

A different way of computing the characteristic function for finite rank perturba- 
tions can be found in [18]. 

We point out that if the measure ju is purely singular (with respect to the Lebesgue 
measure), then the functions 6, are inner (|6,| = 1 a.e. on T). In this case the model 
is especially simple, the model space consists of scalar functions, and that is the case 
treated by the original Clark theory. 

However, in our case, jz is an arbitrary probability measure, so the characteristic 
functions can be non inner, and the model is more complicated: the model space 
consists of vector-valued functions (with values in C7). 


Preliminaries About Clark Operator 


Recall that our goal is to describe a Clark operator, i.e. a unitary operator (non- 
uniqueness is discussed in the next paragraph) that realizes unitary equivalence 
between U, and Mg,. Namely, we want to find a unitary operator ®, : Ke, > 
L?(2) such that 


©, Mo, = Uy ®y. (29) 
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Let us discuss what freedom do we have in choosing such an operator. Clearly, 
®, maps defect spaces of Mg, to the corresponding defect spaces of U,. Therefore, 
Oyb and Dy, must be unit vectors in D Ms and D Moy respectively. 

y 
We say that the unit vectors c € D mx and cy; € D Mo, agree if there exists a 
Y 


unitary map ®, : Kg, > L’() satisfying (29) such that 
yb = Cc, Did =C\. 


If y = 0 and p is the Lebesgue measure, then it is not hard to see that 0, = 0. It 
is also easy to see that in this case, any two unit vectors c € D Me andc; € D Mo, 
agree. 

Otherwise, if either y ~ 0 or yw differs from the Lebesgue measure, then for any 
unit vector c € D Mis there exist a unique vector c} € D Mo, which agrees with 


c; for details see Proposition 2.9 of [31]. That means the operator ©, is unique up 
to a multiplicative unimodular constant a € T; in particular, if we fix a unit vector 
cEeD Ms then the condition ®,c = b uniquely determines the Clark operator ®,,. 

In the trivial case when jz is the normalized Lebesgue measure and y = 0 
the Clark operator ®,, can be easily constructed via elementary means, so in what 
follows we will ignore this case. 


A “Universal” Representation Formula for the Adjoint 
of the Clark Operator 


An explicit computation of the defect spaces of the compressed shift operator Mg 
yields that in the Sz.-Nagy—Foias transcription 


Dus = span{c}, DM, = span{cy}, 


where 
= (1—Jaoy2)7!2 (1 = 9@e@ 
e(z) := (1 |())’) (Fare). (30) 
- ~! (6(z) — A(0 
extyem (leary (2 COO), ee 


where A := (1 — |6|?)!/?. 
Moreover, the vectors c and c; are of unit length and agree. Note also that it 
follows from (27) that 6, (0) = —y, so the above formulas can be further simplified. 
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The following theorem describes the adjoint ., of the Clark operator. Note that 
the intertwining relation (29) can be rewritten as 


@)U, _ Mo, ®). 


Theorem 4.2 (A “universal” representation formula; Theorem 3.1 of [31]) Let 
0, be a characteristic function (one representative) of Uy, |y| < 1, and let Ko, and 
My = Me, be the model subspace and the model operator respectively. Assume 
that the unit vectors c = c’ € Dui» c=c é€ D Me, agree. Let By : L?(u) > 
Ko, be the unitary operator satisfying 


®*U, = Mo, 2%, 


and such that yb =c’, Dy, = ae 
Then for all f <¢ C!(T) 


HF) =A, +8,(0) [OZ anes 2) 


where A,(z) = c’(z), By(z) = ec” (z) — zc} (2). 


Idea of the proof To some extent we mirror the proof of Theorem 2.6. However, 
several miracles occur (beyond the fact that we are now dealing with the vector- 
valued setting of the model space makes the computations are more cumbersome): 

Again, we begin with the intertwining relation ® U, = Ma, ®; and evaluate 
the projection of the model operator 


Me, = M, — zc} (ct)* — 4 (O)e" (ct)* = M, + (ye —zcf)(cq)*. 33) 
We notice that the model operator Mg, on Kg is a rank one perturbation of the 
unitary M-, and the operator Uy on L?(j1) is a rank one perturbation of the unitary 
U; (multiplication by the independent variable). So we expect that the commutator 


® U,- M,®; is at most of rank 2. But in fact, it turns out to be of rank one! 
Indeed, the intertwining relation oy U, = Me, ® can be rewritten as 


DIU, + (y — Me’bt = M,®) + (ye” — ch bt 
(here we used that ®*b = c” and (c})* ®* = (®,c})* = bf), and therefore 


DU, = M.®* + (c” — zc} ) by. (34) 
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From here, we proceed in analogy to the proof of Theorem 2.6 to obtain a formula 
for @5e". 
The formula for Dre ” cannot be computed by simply taking the formal adjoint 


of the commutation relation (34). This is due to the fact that in general zc] ¢ Ko. 
Instead we compute the adjoint of the model operator in analogy to (33) 


M5, = Mz — Mgc’ (c’)* — 0(0)c} (c’)* = Mz + (Yet — Mzc’)(c’)*. 
We find ourselves in the lucky situation that the formulas for ®)§” and ®Y &" turn 


out to be the same. oO 


In the Sz.-Nagy—Foias transcription we derive concrete formulas. For y = 0 we 
have (0) = 0 and by (28) we obtain 6,(0) = —y. With this, the vector-valued 
functions A, (z) and B,(z) in the universal representation formula (32) evaluate to 


—J4,)2y1/2 
= 14+ 76,(z ad ae 

Ay(2) = c"(@) = (l= ly?) el af a -( Thos > (35) 
4 TVA] 


1+ (¥—1)6,(z) - ’) 
(7 1)A, (2) aes 


= (“ — |v?) — (2))/0 — Ga 


i= 6-2 O=0= ph" ( 


(Y — I) Ao@)/I1 — YF)! 


where A, = (1 —|6,|7)!/”. 


Singular Integral Operators and a Representation 
for ®} in the Sz.-Nagy—Foias Transcription 


In this section we get a representation of oy adapted to the Sz.-Nagy—Foias 
transcription, similar to the representations given in Theorem 2.5. 

We first note that for v(§) = |B,(&)|? the kernel K(z,&) = 1/(1 — &z) is an 
L?() — L?(v) restrictedly bounded kernel, see Definition 3.1. Indeed, taking C! 
functions f and g with separated compact supports we get that 


(BE). 8), |dz| 
(,f.9) = / EO 
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and standard approximation reasoning extend this formula to all bounded functions 
with separated supports. But that means that the vector-valued kernel! B,(z)/(1 —€z) 
is a kernel of a singular integral operator L?(j1) — L? with norm 1, and so it is 
L? (1) + L? restrictedly bounded (with restricted norm at most 1). 

A standard renormalization argument then implies the L?(jz) > L?(v) restricted 
boundedness of the scalar kernel 1/(1 — &z). 

Therefore, as we discussed in section “Singular Integral Operators”, see The- 
orem 3.2 and Remark 3.3, the regularized operators 7, with kernel K,(z,&) = 
1/(1 — r€z) are uniformly bounded operators L?(j1) — L?(v), so the operators 
B,T, are uniformly bounded L?(j1) > L?. 

On the other hand, the boundary values of the Cauchy transform R (defined 
in (26)) exist a.e. with respect to Lebesgue measure by the classical theory of Hardy 
spaces; it is easier than for the operators V,, since we do not need a.e. convergence 
with respect to a singular measure here. 

In combination with the uniform bounds we can see the existence of weak 
operator topology limit 


T+ (= w.o.t.- lim T,. 


r>1T 


Note also that T+ can be defined as a.e. limits, 7+ f = lim,_, + T,f. 


Theorem 4.3 Operator -,, can be represented in the Sz.-Nagy—Foias transcrip- 
tion as 


1—lyl2)'2@* -( 0 ) Ca 
eS Y—-(V—DT+DA, a (y —1)Ay Tet 


(seme san) (y HE 
= 1-76 ft+ at Ty, T+f 
fi=yat 2 +1 Ao (7 — GER Ao 


forf € 1?(u). 


As expected this formula reduces to the normalized Cauchy transform for y = 0 
and inner functions @. To see this, we notice that the second component collapses as 
A(z) = (1 — |6(z)|*)!/? = 0 Lebesgue a.e. on T, and that T,f/T4+1 is equal to the 
normalized Cauchy transform. 


Idea of the proof For smooth functions f we replace the term 1 — Ez in the 
denominator of (32) by 1 — r€z and take the limit as r > 17. We obtain the same 
formula (32). Since we also have weak convergence of the operators we have 


'We did not discuss singular integral operators with vector-valued kernels, but the extension of the 
theory presented in section “Singular Integral Operators” to the case of kernels with values in R¢ 
or C¢ is trivial and we omit it. 
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[Sa —f@) 


(TQ) —fOT+D@) = du(é), zetT. 


We extend the operator by continuity to all of L7(j1) and derive 
T+1 = 1/1 — &) (37) 


from (26) through (28). Technical computations then yield the desired formula. O 


Interestingly, similar arguments show that 


T_1 = —4/(1 — 4). (38) 


Representing ®* in the de Branges—Rovnyak Transcription 


We translate the formula in the last Theorem 4.3 from the Sz.-Nagy—Foias transcrip- 
tion to the de Branges—Rovnyak transcription, rather than starting from the universal 
representation formula in Theorem 4.2. This strategy seemed less cumbersome as 
we circumvent having to re-do much of the subtle work of regularizing singular 
integral operators. Also, we found it refreshing to understand the connection 
between the transcriptions. 

By virtue of the definition of the Sz.-Nagy—Foias model space Kg, see (23), a 


function 
81 He 
— E 
. Gs & i 


g— := 0g, + Ag € H2 := L7(T) OB’. (39) 


is in Kg if and only if 


Note, that knowing g; and g_ one can restore gz on T: 
goA =g — 810. 


The equality (39) means that the pair g, = g; and g_ belongs to the de Branges— 
Rovnyak space, see (25). It is also not hard to check that the norm of the pair (gi, g—) 
in the Branges—Rovnyak space (i.e. in the weighted space L7(W), W = wel, see 
subsection “de Branges—Rovnyak Transcription”) coincides with the norm of the 
pair (g1, 2) in the Sz.-Nagy—Foiag space (i.e. in non-weighted L”). Indeed, we have 


(eG): 
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Let B be a “Borel support” of A, i.e. the set where one of the representative from 
the equivalence class of A is different from 0. A direct computation shows that for 


16 
w=(55) 
16) en fio) (10 
(on)™"(ea)=(or,)> 


which gives the desired equality of the norms. (Here 1, denotes the characteristic 
function of B.) 

Note that functions in H? admit analytic continuation to the exterior of the unit 
disc, so a function in Kg is determined by the boundary values of two functions g; 
and g_ analytic in D and ext(D) respectively. 

Since the first component of a function in the Branges—Rovnyak space is the 
same as in the Sz.-Nagy—Foias space and by virtue of Theorem 4.3 we immediately 
know 


we have a.e. on T 


(—lyP)"? Tf 
1—y% T41 


8+(Z) = ai(2)) = (40) 


where 7. was defined in the paragraph prior to Theorem 4.3. 
The second component g_ = g” is analytic on ext(D). Therefore, we do need to 
return to the universal representation formula. After some reformulation we observe. 


Theorem 4.4 (Theorem 5.5 of [31]) Let pt be not the Lebesgue measure. Then the 
function g_ = g” is given by 


(5 ,-\T- (l—|y))/760 Tf 
m=O) i? +7) y= ipo, tol’ 
2 =o z 


(41) 


Formulas for ®, 


A representation of the Clark operator ®, is given in terms of the components 
g+ and g_ of a vector in the de Branges—Rovnyak transcription. This formula is 
given piecewise. For a function f € L*(j1) we denote by f, and f, its “absolutely 
continuous” and “singular” parts, respectively. Formally, f, and f, can be defined as 
Radon-Nikodym derivatives f; = d(f[4)s/dis, fa = A(f )a/d[ea- 

Let w denote the density of the absolutely continuous part of du, ie. w = 
du/dx € L'. 
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Theorem 4.5 Let g = a € Ko, (in the de Branges—Rovnyak transcription) 
ge 
and let f € L*(u), f = ®yg. Then 


(1) the non-tangential boundary values of the function 


ead 
1 G-ppst®: zeD 
exist and coincide with f, j1s-a.e. on T. 
(2) for the “absolutely continuous” part f, of f 


1—7O% 1— 6 


(i= 22 wr = = = 
ly| f ea io.” 


a.e. on T. 


We provide the idea of the proof. First consider statement (2). By taking the limit 
as r — 1 inT,f — T\/,f we prove the Fatou type result [31, Lemma 5.6]: 


Tif —T_f=wf ae.onT 


(with respect to the Lebesgue measure) for all f € L?(). Together with (38) 
and (37) we can use the representations (40) and (41) for g+ and g_ to see the 
desired result for the absolutely continuous part. 

Statement (1) uses Poltoratskii’s theorem [38, Theorem 2.7]. 


Clark Operator for Other a, \|a| = 1 


Consider the Clark operator ®y, : Ko, > L? (fq), Where [y, |] = 1 is the 
spectral measure corresponding to the cyclic vector b of the unitary operator Uy. 
Operator ®,, is a unitary operator, which intertwines the model operator Mo, and 
the c.n.u. contraction (U,), which is the operator U,, in the spectral representation 
of the operator U,. 

We deduce everything from the results we already obtained. First, let us write the 
c.n.u. contraction U,, |y| < 1 as a rank one perturbation of the unitary operator U,,, 
la| = 1: 


Uy =U4+ (y— Dbbt = U+ @— 1)bbt + (y — a)bbt = Uy + (v/a - 1)bb*, 
where b; = ab. 


From this we can just read off the results for aw € T from the results we already 
proved (fora = 1). 
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But to be consistent, we need the operators iy to agree. First we need them to 


the same model spaces, so let us fix the model spaces to be the ones we got for the 
case a = 1. Second, we want them to be consistent with respect to the operators V, 
from section “Unitary Rank One Perturbations”: 


O*, = OV, (42) 


Then an appropriately interpreted “universal” representation formula (Theorem 4.2) 
gives us a formula for ®7 ,. 
Namely, in the spectral representation of U, the c.n.u. contraction Uy is given by 


Me + (v/a — L)b™(b4)", (43) 


where b® = Vb, bf = Vb} = @V,b1, which yields b* = 1, bf (€) = &,é e T. 
Notice that 


Ov = OF bY = OVI = Ob =e", 
and that 
Gy? = OF bf = OV = a0), = ac}. 


Therefore, to get the formula for Oo with A = c’ (i.e. such that Me = 


c’) one just has to replace in (32) ww by fa, and c; by ac} (c” remains the same). 
Note, that as long as c” and G are computed, the parameter y does not appear 
in (32). 

Now let us get the representations in the Sz.-Nagy—Foias and de Branges— 
Rovnyak transcriptions. One of the ways to get the formula for oy would be to 
take the “universal formula” above and then repeat the proofs of Theorem 4.3 and 
of Theorem 4.4. 

But the there is a simpler (in our opinion) way, that allows us to get the result 
with almost no computations: one just have to “translate” Theorems 4.3, 4.4 to the 
spectral representation of Uy. 

In both these theorems the characteristic function and the parameter y are 
included explicitly, so we need to see how they change when we move to the spectral 
representation of Uy. 

If we want to apply know formulas (27), they give us the characteristic function 
Oe of the operator (43) with bf and b* taken for the basis vectors in the 
corresponding defect subspaces. 

So, by replacing w with fg and y with y/a in (27) and (24) we get the 
characteristic function and the defect given by 


Vie =a6,, and Aya = Ay. 
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Substituting these functions to (35) and replacing y there by y/a we get a 
representation formula for the adjoint of the Clark operator mapping L?({1g) > 
Kao, in Sz.-Nagy—Foiag transcription, 


1 lyl2y126* -( 0 ) Cee 7 
det aS e (7/a —(y/a— ITPA, on (v/a = l)Ay Tf 


(44) 


0 I-lyP 1 

1-y6 T%1 OL 
= 1-760 ya. ft + 1/2 Toh. 
(=, yaoi 74+ ms) (7/@ — 1) FE Ao 


where we let 74 f denote the non-tangential boundary values of Rf {1a (z), z € 
But the above formula is not yet the formula we are looking for! To get it we 
applied Theorem 4.3 with jt instead of yw and or a= @6, instead of 6,. But that 
means that the result in the right hand side there paloriss to Ka. So the above 
expression is an absolutely correct formula giving the representation of the operator 
®;,, in the model space Kgq,; that is why we used ®7 ,, and not ®7 ,, there. 
To get the representation with the model space Kg, we notice that the map 


gc) 

§2 ago 

is a unitary map from Kg¢, onto Kg,. Moreover, it maps the defect vector c given by 
equation (30) for the space Kao, to the corresponding defect vector c for the space 


Ko,. Therefore, to obtain the representation formula for ®) ,, we need to multiply 
the bottom entries in (44) by a, which gives us the following representation. 


Theorem 4.6 Operator Ly can be represented in the Sz.-Nagy—Foias transcrip- 
tion as 


(1 —|y)?)'?@% f = ( nee \r+ (i oe) Toy 


(y¥-(V-a@)TUA, (y—a)A, 
eas) en 
= | ~— 1-76 ca . ft ae diy Tif. 
T= 760] yoo) + mt) v= q) be Ao 


Few Remarks About Clark Theory for the Dissipative Case 


Consider a family of rank one perturbations similar to section “Self-Adjoint and 
Unitary Rank One Perturbations’, but with perturbation parameter a € C+ := {z€ 
C: Imz > 0}. In other words, in the spectral representation of A (with respect to the 
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cyclic vector g and spectral measure 4 = 4”) we study the family of perturbations 
given by 


Ag = M,+a(-,1), on L?(2) witha € Cy. 


Vu : 
Recall that we consider the extended class of form bounded rank one perturbations. 
In the spectral representation this condition is equivalent to 


| 2 ae 
ge i+ idl 

Without going into details on the definition of the perturbation in this case, we 
just say that one of the ways is to use the resolvent formula (3). 

While there is no “canonical” model for the dissipative operator, a widely 
accepted way is to construct the model for the Cayley transform T, = (Aq — i) 
(Ag +i). 

So, let us compute co introducing some notation along the way. 

Denote by U the Cayley transform of A = Ao, U = (A — i1)(A + iD). Using 
the resolvent formula (3) and denoting 


b:= A+ gl (A+ 9,  b):=|A-M "gl 14-9 
we can write 


Ty = Uy =U + (y—Wbi)*, 


where 


ye) | 6=@en7¢0< i. du) (45) 


gp stil 


If we denote 


F(z) := / au 


RS—Z 


we get that Q = F(—i) = F(i). 

Note also that |{b|| = {{b;|| = 1 and b, = U*D. It is obvious that y(a) € T for 
a € R. Since ImQ < 0, we conclude that y(w) € D for Ima > 0. Thus Ty is a 
contractive rank one perturbation of the unitary operator U. Under our assumptions 
about cyclicity of g, one can easily see that Disa *- -cyclic vector for U, so U is 
unitarily equivalent to the multiplication U = M; by the independent variable & in 
1? [,), where /1, is the spectral measure of U corresponding to the vector Dd. 

Let us fix some notation: for y = y(a) given by (45) we denote U, = Ty, and 
by U, = T, we denote the representation of the same operator in ibe Gad. In other 
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words, we use T in conjunction with the parameter a € C and U in conjunction 
with the parameter y = y(a@) € D; also we use ~ for the operators in L?(j1), and T 
and U act in L?(1,). 

The spectral measure j1, of U is easily computed. Namely, if w denotes the 
standard conformal map from C+ to D (and from R to T), 


z-i 4 +é 
o(z) := —, a) = i——., 
ay @) =i 
then one can easily see that 
as 1 du(x) 
1 
a ; here dii(x) = = 
[yp = LOW where dji(x) P'1¢x 
here by {20 @~! we mean that fio w~!(E) = fi(w7!(E)), E C T, and P := is ee 


What Is the Model for the Dissipative Case? 


As we mentioned above, it is customary for dissipative operator to consider the 
model for its Cayley transform. Using formulas (27) with 1, for w, and the above 
description of j4., we can write the characteristic function 6,, y = y(q). 

However, since our original objects live on the real line (in L?(j)), it is natural 
to consider the model also to be a space of functions on the real line. The standard 
unitary mapping Q : L7(T) > L7(R), 


1 
Qf (x) = Jaati ° (x) 


maps H?(D) onto H?(C) and so H2 (D) onto H2 (C). So if we use Q7! to transfer 
the model space Kg to the space of functions on R, the model space on the real line in 
Sz.-Nagy-—Foias and the de Branges—Rovnyak transcriptions will be defined exactly 
the same way as the model space on the circle. 

The multiplication M,, by the function w on R corresponds to the multiplication 
by € on T. 

Note also that (27/P)'/?Q is a unitary operator L?(j.) > L*(q) and that the 
map f +> f ow maps Ith) — L? (71) unitarily. 


Characteristic Function in the Half-Plane 


Let us now compute the characteristic function for Ag. For y = y(a) defined by (45) 
let 0, be the characteristic function of T, computed with respect to the vectors by 
and b. Let 
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be the transfer of 6, from the disc to the half-plane; note that we use capital © in 
conjunction with the parameter a € C,. 

Let us now transfer the Cauchy type integrals (26) to C4. For w € Cy letA = 
o@(w). Then 


Lemma 5.1 We have 


_ f de) i 1 I ie 
Rug (A) Sy bees = =f l—-l dl (x) =: Ru(w), 
(46) 

_ f §Adue (&) 1 1 1 de 
Ripg(A) = “1-8 a eh l= ~| dip (x) =: Riu), 
(47) 

_ Pith i i. ..& ae 
Raps) =f 1 ey er) = atl—= eet |te =: Ro(w). 
(48) 


Using formulas (27) for the disc we can write the characteristic functions as 


ee GQ=WPRieov) _ (= y)Roew) - (1+ 7) weCs. 


14+(-yYRimWw) U-y)Rouw(w) t+ (+7) ’ 
(49) 


Note that the formulas for Cy (y = 0, equivalently a = —1/ Q = —1/F(i)) are 
especially simple. And 6 is related to 0, by a fractional transformation: 


ee Dye =~ 6+ 
6, = — = or equivalently 6 = — = . 
1-76 1+ 76, 


Model and Defect Vectors in the Half-Plane 


Recall that the model operator Mg, is the compression of the multiplication 
operator M,, by the function w, 


Mg f = Prog Mol, fEky. 


Let us compute defect subspaces of Mg. 


Considering vectors c = c’ and cy = cr defined by (30), (31) with y,, instead 
of p, define ¢ := cow, C, := c, 0a, A:= Aow=1- (40 |2. Computing we get 
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in the Sz.-Nagy—Foias transcription 


ata = (1- Rory” (1 OBO), 


—6, (i) A(2) 
ae — -1 (9D _o j 
€1(z) = (1 - (2,0?) ue = pias )) 


Then the defect subspaces D and Mag of Mg, are spanned by the vectors 
ba 


M« 
oy 


(2) C1(z) 


Ja(z +i)’ Jn (z + i) 


and these vectors agree. 


Representations of the Adjoint Clark Operator in the Half-Plane 


Using these formulas we can transfer the universal representation formula given by 
Theorem 4.2 from the unit circle T to the real line R. For a function f on the real 
line R define f by 


F(x) := (x +i) -f(). 


and let f, := fo w'. Then we can easily transfer Theorem 4.2 from the disc D to 
the half-plane C+. 
To simplify the notation let us assume that the measure jz is Poisson normalized, 


i.e. that 
d 
P= f He) =1. 
pxetl 


Formulas for the general case P £ 1 can be then obtained if one notice that the map 
f > Pf is a unitary map L?(j1) > L?(/P). 


A Universal Representation Formula 


Theorem 5.2 (A “universal” representation formula for dissipative pertur- 
bations) Let the measure ft be Poisson normalized (P_ = 1). Let 6, be the 
characteristic function of Ty = U,, |y| < 1, computed with respect to the vectors b, 


Singular Integrals, Rank One Perturbations and Clark Model 129 


andb (note that 0, is given by (49)). Let KG, and M, = Mj, be the model subspace 
and the model operator respectively. Let o* LD? (pw) > KG, be the unitary operator 
satisfying 


O*U, = Mg %, 


and such that ®*b(z) =© (z)/(/a(z + i), ®£b1(2) =F (2)/(/a@ +0). 
Then for all compactly supported f € C'(IR) 


Vaz + i) O*f(2) = 
1 
sti 


= A, (z)f(z) + B, © | (f(s) —F(2)) = I- 


] axe) 
where A, (z) = (2), By (z) = @ (2) — wee? (2). 


A Representation Formula in the Sz.-Nagy—Foias Transcription 


For a measure ju on the real line define 7_f to be the non-tangential boundary values 
of the function 


~ 1 1 1 
Rf (Zz) = uP [ro |— ae | du.(s), Imz > 0, 
and let Tif be the non-tangential boundary values of 


o_o (s)du(s) 


Imz > 0; 
2iP S—Z 


R'fu@ = 
the non-tangential boundary values exist a.e. with respect to the Lebesgue measure 
by classical result about boundary values of the functions in the Hardy spaces H?. 


Theorem 5.3 Let 41 be Poisson normalized, P = 1. The operator o* can be 
represented in the Sz.-Nagy—Foias transcription as 


0 7 ( a : 
(Geom gna, J 


I-ly? 1 


0 ee 

_ aw 1-y@ T+1 1 

ee gee ae oe (Rye x | Th 
(irs Ao (y =) ar OO 


II 


Val = ly?) oF 


for f € L?(). 


The same recipe as above gives the representation in the de Branges-Rovnyak 
transcription, and a formula for the Clark operator ®,,. 
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Abstract We provide a quantitative two weight estimate for the dyadic paraproduct 
tp under certain conditions on a pair of weights (u, v) and b in Carl,,,, a new class 
of functions that we show coincides with BMO when u = v € Ag. We obtain 
quantitative two weight estimates for the dyadic square function and the martingale 
transforms under the assumption that the maximal function is bounded from L?(u) 
into L?(v) and v € RH?¢. Finally we obtain a quantitative two weight estimate from 
L?(u) into L7(v) for the dyadic square function under the assumption that the pair 
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(u, v) is in joint Ad andu—! € RH¢, this is sharp in the sense that when u = v the 
conditions reduce to u € Aq and the estimate is the known linear mixed estimate. 
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47B38 


Introduction 


We study quantitative two weight inequalities for some dyadic operators. More 
precisely, we study conditions on pairs of locally integrable a.e. positive functions 
(u,v) so that a linear or sublinear dyadic operator T is bounded from L7(u) into 
L’(v), that is there exists a constant Cr. > 0 such that for all f € L?(u), 


If llzw < Cruvlfllew> 


with estimates on Cr, involving the constants that appear in the conditions 
imposed on the weights and/or the operator. 

There are two current schools of thought regarding the two weight problem. 
First, given one operator find necessary and sufficient conditions on the weights 
to ensure boundedness of the operator on the appropriate spaces. Second, given a 
family of operators find necessary and sufficient conditions on the weights to ensure 
boundedness of the family of operators. In the first case, the conditions are usually 
“testing conditions” obtained from checking boundedness of the given operator on a 
collection of test functions. In the second case, the conditions are more “geometric”, 
meaning to only involve the weights and not the operators, such as Carleson 
conditions or bilinear embedding conditions, Muckenhoupt Az type conditions or 
bumped conditions. Operators of interest are the maximal function [43, 54, 55, 59], 
fractional and Poisson integrals [11, 56], the Hilbert transform [9, 10, 30, 32, 33, 
36, 49, 50] and general Calder6n—Zygmund singular integral operators and their 
commutators [12—14, 47], the square functions [7, 26, 34, 35], paraproducts and 
their dyadic counterparts [3, 20, 21, 28, 44]. Necessary and sufficient conditions are 
only known for the maximal function, fractional and Poisson integrals [55], square 
functions [34] and the Hilbert transform [33, 36], and among the dyadic operators 
for the martingale transform, the dyadic square functions, positive and well localized 
dyadic operators [17, 18, 25, 37, 38, 49, 51, 57, 58, 60-62]. If the weights u and v 
are assumed to be in Ag , then necessary and sufficient conditions for boundedness 
of dyadic paraproducts and commutators in terms of Bloom’s BMO are known 
[20, 21]. The assumption that a weight is in dyadic Ad is a strong assumption, 
it implies, for example, that the weight is dyadic doubling. On the other hand if 
the paraproduct is adapted to the weights u and v, then necessary and sufficient 
conditions for its boundedness from L?(u) into L?(v) are known [39] even in the 
non-homogeneous case, interestingly enough the conditions are different depending 
on whether 1 < p < 20rp > 2. 
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In this paper we obtain a quantitative two weight estimate for z,, the dyadic 
paraproduct associated to b, where b € Carl, a new class of functions that we 
show coincides with BMO“ when u = v € Ag . The sufficient conditions on the 
pair of weights (u, v) required in our theorem are half of the conditions required for 
the boundedness of the martingale transform, namely (i) (u, v) € AY (joint dyadic 
Az condition) and (ii) a Carleson condition on the weights, or equivalently, the 
conditions required for the boundedness of the dyadic square function from L7(v~!) 
into L?(u—!). 

In what follows D denotes the dyadic intervals, D(J) denotes the dyadic 
subintervals of an interval J, |J| denotes the length of the interval J, {h;};<-p denotes 
the Haar functions, mf := WW i f denotes the aise average of f over the interval 
I with respect to Lebesgue measure, and (f, g) := { fg denotes the inner product on 
L? (IR). We prove the following theorem. 


Theorem 1.1 Let (u,v) be a pair of measurable functions on R such that v and 
u!, the reciprocal of u, are weights on R, and such that 


(i) (u,v) € A%, that is [u, vad = supjep m(u-!) myv < 0. 


(ii) there is a constant D,,, > 0 such that 


y. |A;v|?|Z| mu!) < Dy»vVJ) forall J €D, 
1eD(J) 


where A;v := my, v —m_v, and I. are the right and left children of I. 


Assume that b € Carly, that is b € L!} 


ioeR) and there is a constant By, > 0 
such that 


b, hj)|? 
3 He, hal” <B,yw'V) forall €D. 
MV 
1ED(J) 


Then sy, the dyadic paraproduct associated to b, is bounded from L?(u) into 
L?(v). Moreover, there exists a constant C > 0 such that for all f € L?(u) 


ref le2) S Cy/le v1 ag Buo( [le vLag + VDaw )lflli2ey 


where Tf := Doren muf (b, hi) hy. 


When u = v = wi the conditions in Theorem 1.1 reduce to w € Ag and 
b € BMO%, but we do not recover the first author’s linear bound for the dyadic 
paraproduct [1], we are off by a factor of [Ww] en - = . In [44, 45] similar methods yield 


the linear bound in the one weight case, bat. there i is a step in that argument that 
can not be taken in the two weight setting. More precisely, in the one weight case, 
u =v =w, we have ww! = 1 ae. and 1 < m;wm,(w—!); in the two weight case 
we can no longer bound vu! nor can we bound m,;v m;(u7') positively away from 
zero. 
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We compare the known two weight results for the martingale transform, the 
dyadic square function, and the dyadic maximal function. Assuming the maximal 
operator is bounded from L?(u) into L?(v), and under the additional condition 
that v is in the RH‘ class, we conclude the other operators are bounded with 
quantitative estimates involving the operator norm of the maximal function and 
the RH‘ constant. Notice that the boundedness of the maximal function implies 
that the weights (u,v) obey the joint Ag condition, but this is not sufficient for 
boundedness neither of the martingale transform nor the dyadic square function. 
Finally we obtain quantitative two weight estimates for the dyadic square function 
when (u,v) € Ae and u7! is in RH¢. This extends work of the first author [2] where 
similar quantitative two weight bounds were obtained under the stronger assumption 
that u~! € A‘ for some q > 1 (in other words, ule A‘). 


Theorem 1.2 Let (u,v) be a pair of measurable functions such that (u,v) € AY 
andu!«€ RH?¢. Then there is a constant such that 


SUi2e>220) S Cle Wa + pea): 

When the two weights equal w the conditions in Theorem 1.2 reduce to w € 
Ag and we improve the sharp linear estimates of Hukovic et al. [22] to a mixed 
linear estimate. Compare to one weight mixed type estimates of Lerner [41], and two 
weight strong and weak estimates in [7, 26, 34] where similar estimates are obtained 
for the g-function and Wilson’s intrinsic square function [63]. In the aformentioned 
papers, both weights are assumed to be in Ago. 

The one weight problem, corresponding to u = v = w is well understood. In 
1960, Helson and Szeg6 [19] presented the first necessary and sufficient conditions 
on w for the boundedness of the Hilbert transform on L7(w) in the context of 
prediction theory. They used methods involving analytic functions and operator 
theory. The two weight characterization for the Hilbert transform in this direction 
was completely solved by Cotlar and Sadosky in [9] and [10]. The class of A, 
weights was introduced in 1972 by Muckenhoupt [46], these are the weights w 
for which the Hardy-Littlewood maximal function maps L?(w) into itself. We say 
the positive almost everywhere and locally integrable function w satisfies the A, 
condition if and only if 


p—1 
[Ww], = eu (7 [vcoas) (| [vv eoar) <0, 


where [w]4, denotes the A, characteristic (often called A, constant or norm) of the 
weight. In 1973, Hunt, Muckenhoupt, and Wheeden [23] showed that the Hilbert 
transform is bounded on L?(w) if and only if w € Ap. Also, in 1973, Coiffman and 
Fefferman [8] extended this result to the classical Calder6én-Zygmund operators. 
When u = v = wi the joint A, condition coincides with Az. The joint A, 
condition is necessary and sufficient for the two weight weak (1,1) boundedness 
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of the maximal function but is not enough for the strong boundedness [55]. In 
1982 Sawyer found necessary and sufficient conditions on pairs of weights for the 
boundedness of the maximal function, namely joint A and the testing conditions 
[55]. In the 1990s the interest shifted toward the study, in the one weight case, of 
the sharp dependence of A, characteristic for a general Calder6n-Zygmund operator 
on weighted Lebesgue spaces L?(w). In 2012 Hyt6énen proved the A-conjecture 
(now theorem), see [24, 31]: Let T be a Calder6n-Zgmund operator and w be an A 
weight then 


ZF llz20 SC wlarllAll 2c » 


where the constant C depends only on the dimension d, the growth and smoothness 
of the kernel of T, and its norm in the non-weighted L?. From sharp extrapolation 
[15] one deduces that for 1 < p< co, andw €A,, 


{1,1/(p—1)} 
IZ llr) < Carp Oe flv. 


After these groundbreaking results, improvements were found in the form of mixed 
type estimates such as the following L?(w) estimate 


IZFlli20) < Ch, (ae + Ye UF liz200; 
where AZ, = WeegAe , and [w] jad, 1s the Hruscev constant or is replaced by the 
smaller [w] rug aS We do in this paper, see [25, 27] and [42, 54] for other variations. 
Currently a lot of effort has been put into finding two weight analogues of these 
estimates as described at the beginning of this introduction. In this paper we present 
two weight quantitative and mixed type estimates for the dyadic paraproduct, 
martingale transform, and the dyadic square function. 

In this paper we work in R but the results should hold in R¢ and in spaces of 
homogeneous type. 

Preliminary definitions and results are collected in section “Definitions and Fre- 
quently Used Theorems”, including joint A%, regular and weighted Haar functions, 
w-Carleson sequences, the class Carl,,,, the class RH? and its quantitative relation 
to A4,, weighted Carleson’s and Buckley’s Lemmas. The main dyadic operators 
are introduced in section “Dyadic Operators and Known Two Weight Results”: 
dyadic maximal function, dyadic square function, martingale transform and the 
dyadic paraproduct, we record the known two weight results for these operators. 
In section “The Dyadic Paraproduct, Bump Conditions, and BMO vs Carl,” we 
prove our quantitative two weight result for the dyadic paraproduct, we also show 
that when vu = v € Ag then Carl, = BMO‘. We compare our conditions to 
bumped conditions and argue that neither result implies the other, we also compare 
Carl, to the Bloom BMO and related conditions. In section “The Maximal and 
the Square Functions” we obtain some quantitative two weight estimates for the 
dyadic square function and the martingale transforms under the assumptions that 
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the maximal function is bounded and the additional assumption v is a weight in 
RH. In section “The Sharp Quantitative Estimate for the Dyadic Square Function” 
we obtain a sharp two weight estimate for the dyadic square function under the 
assumptions that (u,v) € Az and u7! € RH?¢. 

The authors would like to thank the referee for thoughtful comments, and for 
enticing us to explore in more depth the Bloom BMO condition and compare it to 
Carl,,,. The authors would also like to thank Jethro van Ekeren, a friend of the third 
author and a native English speaker, who proofread the article. 


Definitions and Frequently Used Theorems 


Throughout the proofs a constant C will be a numerical constant that may change 
from line to line. The symbol A, < B, means there is a constant c > 0 independent 


Nr 


of n such that A, < cB,, and A, ~ B, means that A, < B, and By S An. 
Given a measurable set E in R, |E| will denote its Lebesgue measure. We say that 
a function v : R — R is a weight if v is an almost everywhere positive locally 
integrable function. For a given weight v, the v-measure of a measurable set E, 
denoted by v(E), is v(E) = f, 7 U(x)dx. We say that a weight v is a regular weight 
if v((—o0,0)) = v((0,0o)) = ov. Let us denote D the collection of all dyadic 
intervals, and let us denote D(J/) the collection of all dyadic subintervals of J. 

We say that a pair of weights (u, v) satisfies the joint Ag condition if and only if 
both v and u“!, the reciprocal of u, are weights, and 


[u, v}ad — sup miu ') mv < oO, (1) 
E 


where mv stands for the integral average of a weight v over the interval J. Note that 
(u,v) € Ag is equivalent to (v~!, uw!) € A¥ and the corresponding constants are 
equal. Similarly a pair of weights (u, v) satisfies the joint A’ condition iff 


-1 
u,v] ga = Sup m;(uP-")P|! mv < oo. 
[u,v] A pm,( I 

Hi IED 


Note also that (v, v) € As coincides with the usual one weight definition of v € A‘. 


Haar Bases 


For any interval J € D, there is a Haar function defined by 


hia) = Fo (Hiy(0) ~ 00), 


On Two Weight Estimates for Dyadic Operators 141 


where 1; denotes the characteristic function of the interval J, and J, J_ denote the 
right and left child of J respectively. For a given weight v and an interval J define 
the weighted Haar function as 


pean, tl | v(_) _ fvds) 
hy (x) = Vo) ( vy) v(I_) 1) . 


The space L”(v) is the collection of square integrable complex valued functions 
with respect to the measure du = vdx, it is a Hilbert space with the weighted inner 
product defined by (f, g), =  fgvdx. It is a well known fact that the Haar systems 
{hjiep and {h7};ep are orthonormal systems in L?(R) and L?(v) respectively. 
Therefore, for any weight v, by Bessel’s inequality we have the following: 


DIG AP ol? < Mize: 


IED 


Furthermore, if v is a regular weight, then every function f € L?(v) can be 
written as 


f= DOF Mobi, 


IED 


where the sum converges a.e. in L*(v), hence the family {h?};ep is a complete 
orthonormal system. Note that if v is not a regular weight so that v((—oo,0)), 
v((0, oo)), or both are finite, then either 1(¢,9), 10,00), or both are in L?(v) and 
are orthogonal to h7 for every dyadic interval J. 

The weighted and unweighted Haar functions are related linearly as follows: 


Proposition 2.1 ([49]) For any weight v and every I € D, there are numbers a;, 
B; such that 


1;(x) 


hy(x) = a7 hy (x) + Br 7 


where (i) |a?| < mv, (ii) |B?| < 22, and Aju := mp, v — mj_v. 


— mv’ 


Dyadic BMO 


A locally integrable function b is in the space of dyadic bounded mean oscillation 
(BMO‘) if and only if there is a constant C > 0 such that for all J € D one has 


[ioe — my,b| dx < Cll]. 
I 
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The smallest constant C is the BMO‘-norm of b. The celebrated John-Nirenberg 
Theorem (see [53]) implies that foreach 1 < p< w,be BMO‘ iff 


1 
b\\? = — | |b(x) — myb\Pdx < co. 
NOVavoy = SU 7 | [BGs) — mblPax < 00 


Furthermore ||b|| Mod is comparable to the BMO-norm of b. 
In this paper we will mostly be concerned with p = 2 and we will declare 


JOlnwon != Wellawog = sap (=f JC) ~ mole) | 
Lemma 2.2 [fb € BMO% then 


Fie —_ A “ae I( (b, hy)|? 
fl sean 


Proof The family {hy};ep() is an orthonormal basis of the space L5(I) := {f € 
L?(1): ff = 0}. The function (b — m;b)1, € L5(I), hence by Plancherel 


[09 - mb?-dx = ~ |{b,hy)/?. 
JED(I) 


This proves the lemma. oO 


In other words, b € BMO® if and only if there is a constant C > O such that for 
allleD 


S~ |(b,hs)? < Cll. 


JED(I) 


Carleson Sequences 


A positive sequence {A;}rep is a v-Carleson sequence if there is a constant C > 0 
such that for all dyadic intervals J 


do Ars Co). (2) 


1ED(J) 


When v = | almost everywhere we say that the sequence is a Carleson sequence or 
a dx-Carleson sequence. The infimum among all C’s that satisfy the inequality (2) 
is called the intensity of the v-Carleson sequence {A;};ep. For instance, b € BMO4 
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if and only if {|(b,/7)|?}:ep is a Carleson sequence with intensity ||b]|? aioe Le 
following lemma gives a relationship between unweighted and weighted Carleson 
sequences. 


Lemma 2.3 (Little Lemma, [1]) Let v be a weight, such that v~' is also a weight, 
and let {a;;ep be a Carleson sequence with intensity B then {ot;/m;(v—')sep is a 
v-Carleson sequence with intensity at most 4B, that is for all J € D, 


1 Qy] 


— —— < 4Bmyv. 
lJ| mj(v~") 


IeD(J) 


We also need to define a class of objects that will take the place of the BMO“ 
class in the two weighted case, we will call this class the two weight Carleson class. 


Definition 2.4 Given a pair of functions (u, v) such that v and u~! are weights, we 
say that a locally integrable function b belongs to the two weight Carleson class, 
Carlyy, if {\b7|?/mpv}rep is a u~!- Carleson sequence where b; = (b, hy). 


Note that if u = v, then we have that b € Carl,., iff {|b;|?/mjv}rep is a v—!- 
Carleson sequence. The later statement is true if {|b;|*};ep is a Carleson sequence 
(by Lemma 2.3), which in turn is equivalent to saying that b € BMO“. Therefore for 
any weight v , such that v~! is also a weight, we have that 


BMO! C Carly». 


Moreover, if B,., is the intensity of the v~!-Carleson sequence {|b,|?/m;v}jep 
then B,,, < 4|b||? BMol In section “The Dyadic Paraproduct, Bump Conditions, 
and BMO vs Carl,” we will show that if v € Ad then BMO?¢ = Carly, OL? (R) 
(see Corollary 4.6). 

We now introduce some useful lemmas which will be used frequently throughout 
this paper. You can find proofs in [45]. The following lemma was stated first in [49]. 


loc 


Lemma 2.5 (Weighted Carleson Lemma) Let v be a weight, then {ar\rep 
is a v-Carleson sequence with intensity B if and only if for all non-negative 
v-measurable functions F on the line, 


dilint F@))ar < <B i F(x) v(x) dx. (3) 


IED 


In relation to Carleson sequences we consider another class of weights which is 
called the Reverse Hélder class with index 1 and is defined as follows. 


Definition 2.6 A weight v belongs to the dyadic Reverse Hilder class RH‘ 
whenever its characteristic [v] Rud is finite, where 


v v 
[U] en = sup (to —) < 00. 


myv mv 
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It is well known that v € Aq if and only if v € RH. In the dyadic case, v € RH¢ 
does not imply that v is dyadic doubling, however v € A%, does. See [53] for 
more details. Recently, the first author and Reznikov obtained, in [4], the sharp 
comparability of the A“, and RH¢ characteristics. The RH¢ characteristic is also 
known as the Fujii-Wilson Ae characteristic, see for example [54, Equation (2.5)] 
and references therein. 


Theorem 2.7 ((4]) Jfa weight v belongs to the AS, class, then v € RH? . Moreover, 
[U] end = log(16)[v] 4a. 


The constant log(16) is the best possible. 


We would also like to note here that results of Iwaniek and Verde [29] show that 
[When © SUPjep ae , where ||-|| oz), stands for the ®(L)-Luxemburg norm (for 
more details see [4]). In the same paper you can find the following characterization 


of the L log L-norm (Part (a)) and a sharp version of Buckley’s theorem (Part (b)). 


Theorem 2.8 (a) [4, Theorem II.6(2)] There exist real positive constants c and C, 
independent of the weight v, such that for every weight v and every interval J 
we have 


v 1 |A;v|? ( ( v )) 
1 < N<c 1 — 4 
em, (: og (~.)) Ti > my HLS Cm (vlog (4) 


IeD(J) 


2 
and as a consequence |\v||Logts © iW Vredu) ul II. 


(b) Let v be a weight such that v € RH@. Then {\|A;v|7|I|/myv}1ep is a v-Carleson 
sequence with intensity comparable to [v] RH¢: That is, there is a constant C > 0 
such that for any J € D, 


1 |A;v|? 
VI > ie IZ] < Clulayymyy. 
leDU) 


Dyadic Operators and Known Two Weight Results 


We now introduce several dyadic operators which will be considered in this paper, 
and record known two weight results for them. 


Dyadic Weighted Maximal Function 


First we recall the dyadic weighted maximal function. 
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Definition 3.1 We define the dyadic weighted maximal function M¢ as follows 


1 
Mef(x):= sup — | [F091 voydy 
[Dx v() I 
IED 

The weighted maximal function M, is defined analogously by taking the 
supremum over all intervals not just dyadic intervals. A very important fact about 
the weighted maximal function is that the L?(v) norm of M¢ only depends on 


p' = p/(p — 1) not on the weight v. 


Theorem 3.2 Let v be a locally integrable function such that v > 0 a.e. Then for 
alll<p<«o, MM is bounded in L?(v). Moreover, for all f € L?(v) 


Migllee) S Cplifllrey- 


This result follows by the Marcinkiewicz interpolation theorem, with constant 
Cp = 2p’), using the facts that M@ is bounded on L®(v) with constant 1 and 
it is weak-type (1, 1) also with constant 1. Note that as p > 1, C,/p' — 2 and 
Cy = 24/2. 


When v = 1, M, is the maximal function that we will denote M. In [6], Buckley 
1 


showed that the L’(w) norm of M behaves like [w] a , in particular the L?(w) norm 
of M depends linearly on the Az charateristic of the weight. The next theorem is 
Sawyer’s celebrated two weight result for the maximal function M in the case p = 2. 


Theorem 3.3 ({55]) The maximal function M is bounded from L*(u) into L?(v) if 
and only if there is a constant Cy > 0 such that 


[rane v0 dx < C,,u (1), for allintervals I. (5) 
I 


A quantitative version of this result was given by Moen, he showed in [43] that 
the operator norm of M from L?(u) into L?(v) is comparable to 2C,,,. Note that 
Sawyer’s test condition (5) implies (u, v) € Az, moreover [u, v].4, < Cyy- 

A quantitative two weight result for the maximal function not involving Sawyer’s 
test conditions, instead involving joint A, and RH; constant of u—', has been 
recently found by Hy6nen and Pérez [27], see also Pérez and Rela [54]. 


Theorem 3.4 ((54, Corollary 1.4])! Let u and v be weights such that (u,v) € Az 
and u~! € RH, then 


‘Note that in [54, Corollary 1.4] the statement is not exactly this one. The authors are using a well- 
known change of variables that we are not using in this paper: their w is our v, their o is our u—!, 
their [w, o] jag corresponds to our [o—!_ w] jag and hence equals to our [u, v],. Finally in their case 
M(-c) acts on g € L?(c), in our case M acts on f € L?(u), and clearly g € L?(o) = L?(u!) if 
and only if f = go = gu! € L?(u) with equal norms. 
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|Mllz2()>22@ < CCH, v].A, [0 'Te, ye, 


This result is valid in certain spaces of homogeneous type, see [54]. In fact they 
prove a result valid in L’ replacing joint Az by joint A, and the power 1/2 by 1/p. 
More precisely they show 


Mllixw)>u@) < Cp’ ((u, v4, [u "Ten, )””, 


where p’ = p/(p — 1) is the dual exponent to p. 


Dyadic Square Function 


Second, we introduce the dyadic square function. 


Definition 3.5 We define the dyadic square function as follows 


1/2 
S4f(x) = (x Inf — mfi*4))) 


IeD 


where / denotes the dyadic parent of I. 


In [22], Hukovic, Treil and Volberg showed that the L?(v) norm of S¢ depends 
linearly on the A» characteristic of the weight. Cruz-Uribe, Martell, and Pérez 


[13] showed that the L7(v) norm of S¢ depends on [vl *- One concludes that 


SF low < Cluly, lf llzx@) by sharp extrapolation [15], this bound is 
optimal. Lerner [40] has shown that this holds for Wilson’s intrinsic square 
function [63]. 

The following two weight characterization was introduced by Wilson, see 


also [49] 


Theorem 3.6 ((62]) The dyadic square function S¢ is bounded from L*(u) into 
L?(v) if and only if 


(i) (u,v) € Ad 


(ii) {\Z| |Ayu7! |? myv}ep is au~!-Carleson sequence with intensity Cy, . 


ax{ 5.547} 


Condition (ii) can be viewed as a localized testing condition on the test functions 

-1 dy|2 
u- 1, for J € D. Thus, Cy < ||S leq) +20)" 
Recently Lacey and Li [34] showed a continuous quantitative analogue of this 
theorem and they claim the dyadic version is “a direct analog of their theorem”, 


their estimate would read 


IS“ lleaq@220) = ((u, v) ad + Cay (6) 
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We will present a proof of this estimate in section “The Sharp Quantitative 
Estimate for the Dyadic Square Function”. We will get quantitative two weight 
estimates for the dyadic aS function involving either the ah weight norm of the 


or [u,v]? Fa le and [v]!/ 


maximal operator and [uy re > 
Ad RH’ 


he , under appropriate 


eat ? 

assumptions in each case. 
Theorem 2.8(b) implies that if ule RH¢ and (u,v) € Ag then condition (ii) in 

Theorem 3.6 holds with C,, S [u, v] Adu iy Rut As a corollary of (6) we get that if 


u-! € RH¢ and (u,v) € Ag then 
ISI 2y-o220y S CCl og + [es v1 agleag)!? a 


This improves [2, Theorem 4.1] where the stronger assumption u~! € Aa for 
some g > | was made and a similar quantitative two weight estimate was obtained 
with [u~ "aa replacing [u~'] rut and the constant C depending on q. Her results 
are proved in a setting where the underlying Lebesgue measure is replaced by a 
doubling measure o on R (a space of homogeneous type), introducing a dependence 
on the doubling constant of o which is tracked in the aformentioned theorem. We 
will prove (7) without relying on (6) in section “The Sharp Quantitative Estimate 
for the Dyadic Square Function”. A closer look shows that the same argument will 
allow us to recover (6). When uu = v = w e€ Ag this improves Hukovic’s linear 
bound to a mixed bound: 


IS“ IL2~») = Cw] 4 [wo Tene). 


Martingale Transform 


Third, we introduce the martingale transforms. 


Definition 3.7 Let r be a function from D into {—1, 1} so that r(7) = ry, then we 
define the martingale transform T, associated to r, acting on functions f € L?(R), by 


T.£@) = be rf, hy)hy(x). 


IED 


In [64], Wittwer showed that the L*(w) norm of 7, depends linearly on the A> 
characteristic of the weight w. The next theorem is from [49] and it gives necessary 
and sufficient conditions for the martingale transforms 7,. to be uniformly bounded 
from L?(u) into L*(v). Before we state the theorem, let us define the positive 
operator 


Tef() = y= Ws ue 


IED 


— [Ar [Ara ae) I|. 


where a 1 
mv my(u—") 
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Theorem 3.8 ([49]) The martingale transforms T, are uniformly bounded from 
L?(u) to L’(v) if and only if the following four assertions hold simultaneously: 


(i) (u,v) € Ag 
(ii) {\Z| |Apu! |? mv ep is a u~!-Carleson sequence. 
(iii) {|Z| |Ayv|?m;(u7!) ep is a v-Carleson sequence. 
(iv) The positive operator Ty is bounded from L*(u) into L?(v). 
As a corollary of the previous results in this section we can rewrite Theorem 3.8 
as follows, 


Corollary 3.9 The martingale transforms T,. are uniformly bounded from L?(u) to 
L’(v) if and only if the following three assertions hold simultaneously: 
(i) S4¢ is bounded from L?(u) into L?(v). 
(ii) S¢ is bounded from L*(v~') into L?(u7'). 
(iii) The positive operator Ty is bounded from L?(u) into L?(v). 


Dyadic Paraproduct 


Finally we recall the definition of the dyadic paraproduct. 


Definition 3.10 We formally define the dyadic paraproduct z, associated to b € 
L},-(IR) as follows for functions f which are at least locally integrable: 


mf (x) = Smif (b, hi) hi(x). 


IeD 


It is a well know fact that the dyadic paraproduct is bounded not only on L? (dx) 
but also on L?(v) when b € BMO®4 and v € a Beznosova proved in [1] that the 
L’(v) norm of the dyadic paraproduct depends linearly on both [v] ad and ||B||ayo4- 
Sharp extrapolation [15] then shows : 


max{1, 


1 
pt 
taf rw S CUPllamo«yg ?* Uf llaron- 


When both weights u,v € Ag then it is known that the boundedness of the dyadic 
paraproduct zt, : L?(u) — L?(v) is equivalent to b being in a weighted BMO“%(,1) 
where pz = u!/Py—!/?, that is, 


1 
b d(,) = SU <p | Ibe) — mb dx < 00. (8) 
ll>\lamo (u) a (1) p ( 1b| 


This space is known as Bloom’s BMO [5]. In fact there are a number of conditions 
equivalent to (8) (see [20]) one of them being the boundedness of the adjoint of the 
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dyadic paraproduct 2 : L?(u) > L?(v). By duality the last result is equivalent to 
the boundedness of the dyadic paraproduct zm, : L’’(v’) > L’ (wv), where p,p’ are 
dual exponents, p ta = = |, and uv’, v’ are dual weights, namely u’ = wrt = PP, 
Not surprisingly pw’ = (v’)!/"(w’)~'/?" = ys, so that BMO(w') = BMO(). The 
assumption that both weights are in rs is very symmetric and forces boundedness 
of the paraproduct and its adjoint to occur simultaneously. This is the appropriate 
setting when dealing with two-weight inequalities for commutators which very 
naturally can be separated into commutators with a paraproduct, its adjoint, and 
other terms which will all be bounded from L?(u) into L?(v) provided u, v € Ae and 
b is in Bloom’s BMO(jz). Assuming both u,v € Ag allows one to use Littlewood- 
Paley theory for the dyadic square function S“, specifically, the L?(w) norm of 
is comparable to the L?(w) norm of g whenever w € a In particular || 7,/|| 
comparable to ||S¢(zpf) leq = 
from L’(u) into L?(v) of the dyadic paraproduct is reduced to verifying the following 
estimate 


Dv) | 
rep |nuf ?b7m,(v), ai from here boundedness 


Do lmuf PojmM (0) < Curollfllizey: 


IED 


This inequality holds by the weighted Carleson lemma (Lemma 2.5) and the 
boundedness of the maximal function in L?(u) when u € A‘, provided the sequence 
{b?m(v)}rep is a u-Carleson sequence, namely 


S> bymi(v) < Cu). (9) 


1eD(J) 


Another use of the Littlewood-Paley theory (v € Az) allows us to compare the 
left-hand-side tof, ee = m,b\|*v(x) dx yielding what turns out is an equivalent 
condition for the boundedness of the paraproduct from L?(u) — L?(v) when u,v € 
Ag (see [20]) 


WP oH rey 5 [be — myb\?v(x) dx < oo. (10) 


In [21, Theorem 3.1] the authors present an equivalent condition for the bounded- 
ness of the paraproduct from L?(u) > L?(v) when only v € Ag, namely 


Bo(u, v) = SUP Gh aa S> be(mu-')?m,(v) < 00. (11) 
IeD(J) 


Conditions (10) and (11) are testing conditions for the test functions uo! Ly. 
It should be noted that the dyadic paraproduct is a well-localized operator (and 
for trivial reasons) in the sense of Nazarov, Treil and Volberg [51]. Therefore the 
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known necessary and sufficient testing conditions for well-localized operators apply 
in this case, these testing conditions involve the two weights and the function b 
(B. Wick, Personal communication. April 2016). 

In section “The Dyadic Paraproduct, Bump Conditions, and BMO vs Carl,” 
we provide sufficient conditions on a pair of weights (u,v) for the two weight 
boundedness of the dyadic paraproduct operator from L?(u) into L?(v) when 
b € Carl,,, together with a quantitative estimate. The conditions we consider 
are less symmetric, we assume a priori that (u,v) € AY (which is equivalent to 
wu) e AS), and an assymetric weighted Carleson condition, or equivalently 
we assume the dyadic square S$“ function is bounded from L?(v—!) > L?(u7!). 
Under these conditions we show that if b € Carl, then 2, is bounded from 
L?(u) into L?(v). We would have liked to show that b € Carl,,, is not only a 
sufficient condition but also a necessary condition for the boundedness of the dyadic 
paraproduct under the a priori assumptions on the pair of weights, but we have not 
been able to identify the appropriate testing functions that will yield this result. 
If we wish to show that both the paraproduct and its adjoint are bounded from 
L?(u) into L?(v) then we need to assume a priori joint A, and two mixed Carleson 
conditions on the weights, and we need to assume b € Carl,  Carl,—1 ,-1. It 
will be interesting to compare these conditions, for example can one show that if 
u,v € Ag then Bloom’s BMO coincides with b € Carly, M Carl,-1 ,-1? Can we 
conclude that when (u, v) € AY then Carl, is equivalent to B,(u, v) < co? or that 
when v € Ag then Carl, , is equivalent to B)(v,v) < oo? We record some results 
comparing these conditions in section “Carl,,,, vs Bloom’s BMO”. 


The Dyadic Paraproduct, Bump Conditions, 
and BMO vs Carl, 


In this section we will state and prove our main two weight result about the dyadic 
paraproduct (Theorem 1.1 in the introduction, called Theorem 4.1 in this section). 
We will also compare our result to known two weight bump conditions, compare the 
class Carl, ., with BMO® when v € A, and compare the class Carl,  Carl,-1 ,-1 
with Bloom’s BMO when both u,v € Aq. 


Two Weight Estimate for the Dyadic Paraproduct 


In this section we obtain quantitative two-weight estimates for the dyadic paraprod- 
uct 2, when b € Carl,,, and (u, v) are two weights with some additional conditions. 
Note that by definition b is a locally integrable function, thus b; = (b,h;) is well 
defined. The next theorem is a literal restatement of Theorem 1.1 which we provide 
for ease of the reader. 
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Theorem 4.1 Let (u,v) be a pair of functions such that v and u“' are weights, 
(u,v) € AY, and {|A;v/?|I|mp(u7!) ep is a v-Carleson sequence with intensity 
Dy». Then my is bounded from L?(u) into L?(v) if b € Carl,,. Moreover, if By,» is 
the intensity of the u-!-Carleson sequence {|b;|"/mjv}\1ep then there exists C > 0 
such that for all f € L?(u) 


Il of llr2~v) = Clu. V)gBuw (fe, v} ad + Vv Dy ) All rz): 


Proof Fix f € L?(u7') and g € L*(v), then fu“! € L?(u), |lfu' law = Wf lla. 
gu € L*(v7') and |gv|l2@—-1) = IIgllz2@y, 70 (fu ') is expected to be in L*(v) , then 
gu € L?(v_!) is in the right space for the pairing. Thus, by duality, suffices to prove: 


[(o(fu-"), gv)| = Cy/la, v] gBuo( s/t, vag + Vac ) Uf lli2e-yliglli2e. 12) 


Replace h; by a;h; + biag where a; = a; and B; = 7 as described in 
Proposition 2.1, to get 


[(scp(fu), gv)| < > [balmy (flu) 


IED 


1 
(ev. cnn} + pe) (13) 


Use the triangle inequality to separate the sum in (13) into two summands 


[(xp(fie-), gv) < o Nbrllarlmg ple ihgo, AB) + ST Noy aed CUfle gv. 11) 
IED IED vial 


[Arv| 
myv 


Using the estimates |a,| < ./m;v and |6;| < in Proposition 2.1, we have that, 


[(,(fu-'), gv)| < D1 + Xo, 


where 


y= sy [bel ([flu“") (gu, hy) | Vv 


IED 


Eo := Do lori (lflu I (gv, 1) ot 


IED mv 4/|I| 


Estimating ©): We have 


|bi| 


-1 
< mi (If|)|(g. 27) vm! mv 
jeD VITUV 
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|br| : v 
Suv ag Dae ink Ma f0DI i 


IED 
\b1 : 1/2 : 1/2 
= lwtlae( Oe nkMe vt) (Sheen) 
IED IED 


Here in the first line we use that (gv,f} = (g,f)y, in the second line we use that 
m'|f| = mu) < M,-1f (x) for all x € I, and that m;(u7!)myv < [u, v] gq and 


my(u—!) 
in the third line we use the Cauchy-Schwarz inequality. 
Using the fact that {h7}7ep is an orthonormal system in L°() and the Weighted 
Carleson Lemma 2.5, with F(x) = M?*_,f(x), and a = |b;|?/myv, which is a u~!- 


Carleson sequence with intensity Biss by assumption, we get 


1/2 
Bi < (uray VBuo( f ME veuw ear) Mtoe 
= 2V2[u, v} ag Vv Buy» All 2~u-1) lgllz2q): (14) 


In the second inequality we used Theorem 3.2. 
Estimating & : Using similar arguments as the ones used for X; , we conclude 
that, 


Bo < Yo [bili (Ufpm¥ (lel) el Vili! — 


IED 


Sait (fDi lero] Vfl) imo 


te Vv 


[b/| . ; 
< [u, vd ——|Ayv| yd] vmu-" inf M, if (x) inf My g(a) 
- TeD vel ° 


1/2 
tole |bil* inf M2) (2 arelFm Ging 209) 


myv xel 
IED 


1/2 


By hypothesis {|b;|?/myv};ep is au” !-Carleson sequence and {| A;v| ||m;(u7!) }ep 
is a v-Carleson sequence with intensities 6,, and D,,, respectively. By 
Lemma (2.5), 


1/2 1/2 
Xo s [u, UV] 44Buy Puy ( M?_f (xu! (od) ( / Mig(s)u(sar) 
V 7 R R 


[u, UV) 44 Buv Puy Mi U2) IIMvallr2@) 


[u, UV) 42 Buy Pu All2@— lg llzz@- 
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This estimate, together with estimate (14), gives (12). oO 


We can replace the conditions on the pair (u,v) by boundedness of the dyadic 
square function to deduce boundedness of the dyadic paraproduct when b € Carl,,,. 
Corollary 4.2. Let b € Lj,,(R) and (u,v) be a pair of functions such that v and 
u_! are weights and {\b;|"/mjv}1ep is a u—'-Carleson sequence (b € Carly») with 
intensity B,,y. If the dyadic square function S“ is bounded from L?(v~') into L?(u7') 
then the paraproduct my is bounded from L?(u) into L?(v). Moreover 


ref lz) < Cyl, Vag Buw( [le vLag + ISl20-5 20-4 lille: 


Proof Assume S¢ is bounded from L?(v~') into L?(u~!). Theorem 3.6 implies 
that (u,v) € Az and {|A;v|?|J|m;(u~!)}rep is v-Carleson sequence with intensity 
C,-1,-1. Moreover, C,-1,-1 < IS Meo (1) +2207): These two facts together 
with the hypothesis that {|b;|?/m;v}jep is a u~'-Carleson sequence imply, by 
Theorem 1.1, that sz, is bounded from L?(u) to L?(v). The claimed estimate holds. 

oO 


If we especialize to the one weight case u = v = w € A§ then S4lz2@-1 < 
Clw!] ad = C[w],g- Moreover, b € Carlyy O Lj, is equivalent to b € BMO“ and 


Buy < C lelleuoe we show this in Corollary 4.6. The previous Corollary would 
give us that 


3 
Ilo |lz2¢)>22~) S CllAllanco LW] ja 


aL. 
Thus, we do not recover Beznosova’s linear bound, we are off by [W] ia: 
2 


Comparison to One-Sided Bump Theorems 


The dyadic paraproduct is especially interesting because it allows us to estimate 
Calder6n-Zygmund singular integral operators (CZSIO). The general approach to 
the two weight estimates for the CZSIO as a class is a bump-approach. We refer the 
reader to [48] for the precise definitions and statements, the interested reader can 
also consult [59] in this volume. 


Theorem 4.3 ([48, Theorem 3.2]) Suppose ® satisfies several conditions.” Sup- 
pose that there exists a constant C such that for all I € D 


lw nrllullowy < C. (15) 


?The conditions on the function ® are satisfied by the functions ®(L) = Llog'T? L and 
Llog Lloglog!*° L (for sufficiently large 0 > 0), but not by ®(L) = Llog L. 
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Then any Calderén-Zygmund singular integral operator T is weakly bounded from 
L?(u) into L?-~(v), i.e., 


Clif liz \? 
ver eR: FO) =A} < (See) | 16) 
Let us assume that u and v are such that 


lu" eallvlizogzs < C, 


which is a weaker condition than the condition in Theorem 4.3. Then by Theo- 
rem 2.8 we have that, for every J € D, 


1 |A;v|? C 
Illacgus * Ty ee IE ca (17) 
red) MN 


The condition we have for the paraproduct is 


1 


i S > [Arv?m;(u7!)[7| < Cmj(v) (18) 


IED(J) 


Note that if (u,v) € A$ we have that 


1 1 |A;v|? 
— Ajv|?m (ua!) < [u,v] ga T 
Gy oe Fem < Lae DO Id 
TED(J) IeDVJ) 
; [u,v], . 
while myv < te Therefore we cannot compare bump conditions to the 


conditions in our results without the additional assumption that there is a constant 
q > O such that mj(u~!)myv > q for all J € D. If. g < mj(u~!)myv < @ for all 
J € D; the two conditions (17) and (18) become equivalent, but this assumption 
essentially reduces the problem to the one weight case [44, Proposition 7.4]. 


BMO vs C,,, 


Formally the dyadic paraproduct is a bilinear operator for the locally integrable 
functions b and f. After we fix b in BMO“, we consider 7, as a linear operator acting 
on f. In the following proposition, we try to answer the question: if , is bounded on 
(weighted) Lebesgue spaces, then in which space does the locally square integrable 
function b lie? 
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Proposition 4.4 (A necessary condition for boundedness of m,) Let u and v be 
weights and, for 1 < p < 00, b € L7,.(R). Assume mp : L?(u) > L?(v) is a bounded 
operator then there is a constant C, > 0 such that for anyl € D, 


i |b(x) — mb v(x)dx < Cpu(f), (19) 
I 
where I is the dyadic parent of I. The constant cP is the operator norm 


Il tp ler wz) 


Proof Let us choose f = h,; for some dyadic interval J. Then, by assumption, there 


exists a constant C, = IZslow +e) such that 
|r (hy) (x) /Pu(x)dx < Cy J [hy (x) /Pu(a)dx = Cp, ——,. (20) 
R R JP! 
On the other hand, 
my(hy)(x) =) mj(hy){b, hy)hy(x) 
IeD 
= > — i hy)hy(x) — Yo = ( hy) hy (x) 
IeD(J+) IeD(J-) 


om = my, b) Ly, (8) = (b@) = my_b)1y_09)] 


Wi 


where the last equality is due to the fact that (b — myb)1; = rep (d, hi) hi. 
Therefore we can write 


1 
[ |p (hj) (x) /Pu(aydx = pe p |(b(x) — mz, b) 1, (&) — (b@) — my_b)1y_(@)| v(x) dx 


: Dp 
= pe (/. |b(x) — my, bP v(x)dx + [ |b(x) — my oco) ; 


Thus we can conclude that there is a constant C, such that for all J ¢ D 


[ve — myb|Pu(x)dx < C,u(1). 
I oO 

The condition (19) can be considered as a testing condition for the bound- 
edness of the dyadic paraproduct from L?(u) into L?(v). When u,v € Ae both 
weights are doubling weights, in particular u(1) < D(u)u(J) (where D(u) := 
supjep UL) /u() < oo is the dyadic doubling constant of wu). In this case, (19) 
becomes 
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[ive — myb\P v(x) dx < Chu() 
I 


which is equivalent to the boundedness of the paraproduct and its adjoint [20, 
Theorem 4.1] from L?(u) into L?(v) when u,v € A When u = v this necessary 
condition was known in the more general matrix A, context [28]. 

One can immediately conclude that the inequality (19) implies that b is in BMO“ 
for u = v = 1 (Lebesgue space). Thus, one can view the condition b € BMO4 as 
a testing condition for the boundedness of the paraproduct on L?(R), in the same 
way that the conditions T1,7*1 € BMO in the celebrated T1 Theorem are testing 
conditions. 

For the weighted Lebesgue space, we have the following corollary. 


Corollary 4.5 For 1 < p < o,b € L?.(R), if m is bounded from L?(v) into 


loc 
itself and v is an AG weight, then b belongs to BMO*%. Moreover, ||b||pyoa < 


1 
2llallareyur lo] 


Proof For any I € D, we have 


[pv — my,b| dx = [ioe - myb|v> (x)v~? (x) dx 
I I 


b=) ble Giaee) 
<(f (x) — myb|? v(x) ) (f (x) x) 
< cplr( [oe ax)’ ( [oro ax)" (21) 
I I 
1.1 5 1 1 - 
=? |i\( — ie) | Lee 22 
rla(= [ves :) Ge fe () :) (22) 


1 
< 2[lzollowovelrlaglM- 


Ja 


Here the inequality (21) holds due to (19) with v = u. oO 


Notice that b € BMO implies that b € L7.. 
Nirenberg inequality. 

For the two weight case, in order to show that (22) is bounded, we need (v, u) € 
A, which is totally different from (u,v) € A,. Thus, we cannot conclude anything 
more than (19) for the two weight situation. 

To finish this section, we give a relation between BMO? and Carl, y. 


Corollary 4.6 If v € Ag then 


(R) for all 1 < p < oo by the John- 


BMO® = Carly, 0 Lj,(R). 
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Proof In section “Carleson Sequences”, we observed that BMO4 C Carl,,, for any 
weight v such that v—! is also a weight. Also recall that by the John-Nirenberg 
theorem if b € BMO then b € Lj, .(R). Thus, to complete the proof, we need to 
show that if v € Ag and b ¢€ Can, » ML7(R) then b € BMO‘. If v € Ag then in 
particular v € RH? By Theorem 2.8, it follows that, for every dyadic interval J, we 
have 


A 
Te (emo < Ble DO | Wl Ul < Cholegllangmse . (3) 


MV 
IED(J) TED(J) 


Since v € Ag and b € Carl, .,, all conditions of Theorem 1.1 are satisfied, and we 
know that the dyadic paraproduct, zp, is bounded from L?(v) into L?(v). Thus, by 
Corollary 4.5, b must belong to BMO%. oO 


Similar one weight results are shown by Isralowitz, Kwon, and Pott [28] in the much 
more general matrix A, context. 


Carl,,, vs Bloom’s BMO 


There are other weighted bounded mean oscillation spaces in the literature. The 
dyadic weighted BMO space for a weight yz in R“, denoted BMO“(j1) in [20, 
Section 2.6], consists of all locally integrable functions b such that 


1 
llb|laao4(u) = Sup — | |b(x) — mgb| dx < 00, 
BMO*%(1) 0 u(Q) 0 Q 


where the supremum is taken over all dyadic cubes with sides parallel to the axes. 
In that paper, it is pointed out that when the weight is in A. (hence, in particular, 
is a doubling weight), one can replace the L' with L? norm provided the integration 


with respect to the Lebesgue measure is replaced by o dx where 0 = yt is the 
conjugate weight. 

When u,v € Ag, let w := u'/*v~!/?, the corresponding weighted BMO“(j1) is 
Bloom’s BMO [5]. In [20, Theorem 4.1] it is shown that the following are equivalent 
conditions. 


(i) b € BMO4 (pL). 
(li) be es meaning ae Pp / |b(x) — myb|? | (x) dx < 0. 
(iii) sup —~ = [100 — myb|*v(x) dx < 00. 
red UCL) Jy 


(iv) Sup a [ioc — myb\|?u7"(x) dx < 00. 


(v) ae is vou from L?(u) into L?(v). 
(vi) af from L?(u) into L7(v). 
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Theorem 4.7 Assume u,v € Ag and let w = u'/?v—'/?, Then b € Carly, if and 
only if b € Carl,-1 ,-1 if and only if b € BMO4%(u~'). 


Proof First we will show that Carl, U Carl,—1,-1 C BMO“(u~'). Assume b € 
Carl,,, U Carl,-1 ,-1. By assumption there is C > 0 such that for all J ¢ D 


(a) yt <cu'(/), or () YO 


red IEDVJ) 


< Cv(J). 
mu 


When w € A the dyadic square function S“ obeys an inverse estimate |[/'|| Dw) S 


Cl] IS“Flli260y: In case (a), since v-! € A4 we can use the inverse estimate for 


S¢ in BP!) and get, for all J € D, the estimate 


I — my) Uylijay—ty) S Clu] gall S*((b — myb) Ly) M721) 


— 2 —1 
= Clu] D> bpm 


1eD(J) 


< Cluliy™'W) 


1 
Hence we conclude that sup —— [ive — myb\?v—! (x) dx < oo. 
reD U'(L) Jy 


Similarly if we assume (b), we will conclude sup / |b(x) — mb? u(x) 


dx < oo, using this time that u € Ag . These integral conditions are each separately 
equivalent to b € BMO(1~!) when u, v € Ag by the results in [20, Theorem 4.1]. 

Assume now that b € BMO(u!) and u,v € Ag. We will show that b € Carl, 
Carl,-1 ,-1. The assumption implies that 


I — mb) Li llpq—ry < Cu TY) and || —myb)1yl2q) < CoV). 


ws 
Both u,v € A$ so are u-!, v! € AY, also 1 < m;v mv, and the dyadic square 
function is bonnidéd in L?(w) for w € A¥, moreover ||S“((b — b,)1,)|\? 


rep |br\?miw. We therefore conclude that 


Rw) 


|br|* 

DS DLE [iP mv! < Clv]all(b — msb)Llljaq—1 S Cw), 
mv 

1ED\J) IlED(J) 


|b7|? 
De aS DE lor? < Clulgg I — rsd) Lillia S CoM). 
IED(J) IED(J) 


On Two Weight Estimates for Dyadic Operators 159 


Hence b € Carl, O Carl,-1 ,-1. 

All together we have shown Carl, U Carl,-1,-1 C BMO(u™!) © Carl, 9 
Carl,—:,-1 which implies that Carl,, = BMO(u~') = Carl,-1,-1 when 
u,v € AS. oO 


We just showed that when u,v € Ag and the dyadic paraproduct x, is bounded 
from L?(u) into L?(v) then b € Carl, ,,. Compare to Corollary 4.9 where only v € Af 
and the pair (u, v) is in joint A>, but we assume b € Car',,,, (note that the roles of u 
and v have been interchanged, and in general Carl,,,, 4 Carl,,,). 

When we assume only v € Ag then zr, is bounded from L?(u) into L?(v) iff (11), 
that is 


By(u, v) := PH Pe 7m, (u_!)?myv < oo. 


Lemma 4.8 /f (u,v) € AG and b € Carl,,, with intensity B,,, then By(u,v) < 00. 
Moreover 


Ba(u,v) < [Py Bu 


Proof The result follows immediately using first the joint A, condition and then the 
Carl,,, condition, 


b? = 
> b3 sm (uo Ymyv < [u, ag a a < [u, Vg Buvt ‘WD. 


JED(I) JED() 


This implies Bz (u, v) < [u, VP Buy < 00 as required. Oo 
2 


Using the results in [21] we will conclude that 


Corollary 4.9 [f (u,v) € Ative Ag, and b € Carly, then my is bounded from 
L?(u) into L?(v). 


As observed in [44] if (u,v) € AS, v € AY (or u € A%), and b € BMO4 
then the boundedness of the paraproduct reduces to one weight boundedness 
on L7(v) (or on L?(u)). The observation being that joint A, implies, by the 
Lebesgue Differentiation Theorem, that v(x) < [u, v] Agu(x) for a.e. x, and therefore 


Igllz2w) < lu vy jligll2q: If v € Ad then by Beznosova’s one weight linear bound 


for the pamapraduc! in L?(v) [1] one has 


I zaf lize) < Cihlamotvlaa lf lle) < CBlamo« Ll ag le, v7 If lew 
2 AS 
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Likewise if u € AS, then 


1/2 1/2 
ref ize) < [ v) elloref zy S CBlamorllagle. va lflli2uy: 


where we used Beznosova’s result in the last inequality. Using this observation we 
can deduce Corollary 4.9 without using the machinery of [21] if we can prove that 
(u,v) € Ad, v € AY, and b € Carl,,, imply b € BMO‘. 


Lemma 4.10 [f (u,v) € A$, v € AY, and b € Carly OL; 


loc 


(R) then b € BMO‘. 


Proof Suffices to show that b € Carl, ,. Notice sae the Cauchy-Schwarz inequality 


vd ad 
and the joint A, condition imply =i w= <v(VJ) < Tay 7 , therefore 
1 be [wud ag b? 
< Bu, ve 
v(J) p> mv — u-!(J) 2 mv <u, v]ag 
IED(J) 1ED(J) 
We conclude that b € Carl, 0 L7,.(IR) = BMO% by Corollary 4.6. Oo 


It may be worth to point out that when u = v € Ap, the condition B,(v, v) < co 
coincides with b € Carl, ,. The reason being that now we do have the lower bound 
as well as the upper bound 1 < m;v m;(v~!) < [v],,- 


Lemma 4.11 if w € A» then b € Carly y if and only if B2(w) < 00, where 


Bo(w) := Bo(w,w) = sup iG Ta Y= mi(w') |b? mw. 
IED(J) 


The Maximal and the Square Functions 


In this section we relate the boundedness of the square function with the bounded- 
ness of the Maximal function from L?(u) into L?(v). The main result of this section 
states that if the weight v is in RH' : and the Maximal function is bounded then the 
square function is also bounded. This result is an adaptation of Buckley’s proof [6], 
for the fact that if w € Ag then S¢ is bounded on L?(w). The last author proved a 
similar result, in [52], for the weighted maximal function and the weighted square 
function in L7(R) and 1 < q < ow. 


Theorem 5.1 Let (u,v) be a pair of weights such that v € RH@ and the Maximal 
function M is bounded from L?(u) into L?(v) with bound M,,y then there exist C > 0 
such that 


2 
Sf lize) < CMa (1 + [vo] IF ll2an- 


RH 
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As an immediate Corollary of Theorem 5.1 and Theorem 3.4 we get, 


Corollary 5.2. Assume (u,v) € A%, u! € RH%, and v € RH4, then 


= 1/2 
IS“Flli2q S C(le PL glen) 0 + PAI lacy: 


Note that this estimate does not recover the linear estimate in the one weight 
case u = v € Ag, it is off by a factor of the form Gree unlike the estimate we will 
present in Theorem 6.1. 


Proof (Proof of Theorem 5.1) 
Given real-valued f € L?(u) we have 


1 A 
SFllia) = DMF ad Pry = 5D buf — mgf Po) 


IED IED 


1 A 
=5 Do (mif = mif)v := Ea. 


IED 


Adding and subtracting 2v(/)m7f in the sum and rearranging 


D1 = D> (Qv(Dmif — vDm5f) + D> (vo — 20) mif =: D2 + Bs. 


IED IED 


Therefore, it is enough to check that for all f € L?(u): 


|=;| < CM? 


uv 


C+ D]pia Uflig) for §= 2,3. 


Estimating ©: First, let dm := Yojep, 2v(Dmjf = 2 f(Enf(x))’v(x)dx where 
Enf (x) = myf for x € I € D,, and D,, is the collection of all dyadic intervals with 
length 2~”. Then 


oe) 


X= ye (2u(Dmif — v(Dm;f) = (Ain = Am—1)- 


IED m=—oo 


Using the fact that E,,f(x) < Mf(x) for all x € R we can bound each a,, by 
lan| a 2f |Mf (x)? v(x)dx = 2IIMF lla. = CMiy Mf llz2qy: 


The last inequality follows since M is assumed to be bounded from L?(u) 
to L7(v). Let s, i= lal <n(4m — 4m—1), the partial sum sequence of &. Since 
this is a telescoping sum we have s, = (d, — a—y-1) for all n € N. Therefore 
lsn| < 2CMi. sIlf lz. (wy forall n € N which leads us to the better than desired 


estimate 


| X| = CMa liza: 
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Estimating &3: Since every interval has two children, switching the sum over / to 
a sum over the parents J = J we have the following cancellation, 


Y= (vo@ — 20) mef = D> (vV) — 204) + vV) — 20) mif = 0. 


IED JED 


Hence we can write 


D3 = D> (v() — 2v(L)) (mpf — més). 


IED 
Applying the Cauchy-Schwarz inequality, 
1/2 1/2 
[23] < (x WOE) 4, mif + mf) 3) (Xedvemr— mp) 
IED 


1€D uJ) 
= 22/04) < 2-7 (E44 D1), 


(od) md) 


where D4 := Yep (nyf + mf)”. Thus, 


<|Eo| + [Es] < CM? yMlfliizqy + 2/724 + Dh). 


Subtracting 2~!/?, from both sides of this inequality and multiplying by (2+ /2) 
give us 


XS Milf llizqy) + U4- (24) 
Estimating 4: Note that m;|f| < 2m;|f|, hence (mf +m;f)? < (3m;|f|)*, and also 


note that |v(Z) —2v()| = 2\7|7! |A;u|. Switching the sum over J to a sum over the 
parents J = D gives 


Las \> —— | ant | mjlf|- 


JED 
Thus 
| Ajv vey nclfl < |A; 2 
Eas Dlg ltl SD) ——— [int MF (x) 
IED IED 


Ss [Ul en tes = Man IMFllioe = Mle Mi vllf lira: 
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Note that in the third inequality we use the fact that if v € RH? then, by 
Theorem 2.8, {|A;v|?|/|/myv}iep is a v-Carleson sequence with intensity [U]ane- 
This estimate together with (24) give us the desired estimate for real-valued 
functions. Using this estimate for the real and complex parts of f € L?(v) we will 
conclude that 


2 
ISFlli2q) S CMO + Play Mf lize: 4 


Even though not explicitly we are still assuming that (u,v) € A‘, since we 
assumed that M : L?(u) — L?(v) which implies (u, v) € A%, see [16]. 


Remark 5.3 In the last theorem we are providing a connection between the 
boundedness of the square function and the boundedness of the Maximal function. 
Another novelty of this result is that we have an estimate on how the norm of the 
square function depends on [v] RH? and the norm of the Maximal function. 


As a consequence of Theorems 5.1 and 3.6, we can show that the boundeness 
of the Maximal function from L?(u) into L*(v) together with the assumption that 
ve RH¢ will imply the boundedness of the martingale transform. 


Theorem 5.4 Let (u,v) be a pair of weights such that v € RH@ and the Maximal 
function M is bounded from L?(u) into L?(v) then the martingale transforms T,, are 
uniformly bounded from L?(u) into L?(v). 


Proof Let us consider a pair of weights (u,v) satisfying the assumptions. By 
Theorem 5.1 the dyadic square function is bounded, and by Theorem 3.8 the pair of 
weights (u, v) satisfies 


(i) (u,v) € Ay 
(ii) {{A;u7!|?myv|I|};ep is a u~!-Carleson sequence. 


Let us denote the intensity of the u~!-Carleson sequence in (ii) by D,,,. To prove 
the boundedness of the martingale transform 7,, we need to show that (u, v) also 
satisfies the last two conditions in Theorem 3.6 


(iii) {|A;v/?m;(u7!)|Z|}zep is a v-Carleson sequence. 
(iv) The operator Tp is bounded from L7(u) into L7(v). 


For condition (iii), we use the assumption v € RH“, Theorem 2.8(b), and 
(u,v) € A§. More precisely, for any J € D, 


II 


Ayv|? 
YA Pm ul] = >> OE me Oil 
MU 


IED IeD(J) 


|Arvl? 
[us vLag DY) Ul S Clee Pag lle?) 
edu) 


lA 
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We now need to check condition (iv), which for any positive f € L?(u~!) and 
g € L’(v) is equivalent to 


(Zofu'), gv) | < Clif llzu—y lalla - 


Thus, it suffices to verify the estimate 


Sm fle!) (gly d 


= Wl < Cifllza-yllallza)- (25) 
IED my(u~") 


To see that (25) holds, we use the Cauchy-Schwarz inequality: 


Soom (flu rm go) ty 


=] 
IED m(u~') 


iy sD 1/2 1/2 

= (x ee 2) [Arum (x (mis) Loe el rin) 
IED (u~") 1€D may 

< yoiA u!|?myv|I| inf M?_,f (x) 7 a ~ i || inf M, (x) ia 

= I I cera! v8 : 


IED IED 


2 
Since | A;u~!|?m,v|/| is a u!-Carleson sequence with intensity D,,, and [Ani || 
is a v-Carleson sequence with intensity [v] RH? by condition (11) and Theorem 2.8(b) 


respectively, we have that 


= | aa a ‘| 
do mi ((f lun (\g|v) — IS /Duvlvlanll Mall 20 Moslli2e) 
my (u-") ; 
IED 
<8 Duv lay Fll2w-y lls ll2@) , 
the last inequality by Theorem 3.2. oO 


As an immediate consequence of the proof of Theorem 5.4 and Corollary 5.2 we 
get the following corollary. 


Corollary 5.5 If (u,v) € A4, uu! e RH? and v € RH! then the martingale 
transforms T, are uniformly bounded L?(u) into L?(v). 


The Sharp Quantitative Estimate for the Dyadic Square 
Function 


Our last theorem provides the dependence of the operator norm ||S*|| PWw—>2(v) 
on the joint A> characteristic of the weights and [u7'] RH¢ This extends results of 
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Beznosova [2], and we follow the template of her original proof. We could have 
used instead Proposition 2.1 as remarked by our careful referee, we leave to the 
interested reader to verify that this can indeed be done. 


Theorem 6.1 Let (u, v) be a pair of weights such that (u,v) € A$ and u~! € RH¢. 
Then there is a constant such that 


— 2 
IS“Iaey>20) S Cl a1 + "pea: 


Proof We can write the square of the norm ||S“/|| L(y) as: 


IS“Fll2>c _ py \(f, hy) |? 17(x) v(x)dx = > \(f, Ar) |?myv. 
IED 


HI 1D 


We decompose /, in a slightly different way. For any weight u—', we can write 
hy as 


Ayu! 


an ~ Bim) 


Hi '(«) +Af '1,0)) where At 


1 
amt 


The family {u~!/ 2H ep is orthogonal in L?(dx) with norms satisfying the 
inequality 


= 1 
PHT lIa2@ < V\elmi(u"). 


Hence by Bessel’s inequality we have that for all f € L?(u) (recall that f € L?(u) if 
and only if fu'/? € L?(R)), 


l(f, HY 
yo < I ln = Williz: 


Hiei We = 
2 imu ; 


Since mv < [u, v] ga/mm (ue) we conclude that for all f € L?(u), 


—1 


H# 2 
lr. a] my < [u,v] ll lq (26) 
IED Vv 
We claim that 
AY 1 \/? 7 
Tle AD] mee Clee ge en 
1eD 
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Using estimates (26) and (27) and the Cauchy-Schwarz inequality we con- 
clude that 


> l(f. hr) |? mv < C([u, v] ad + 2Iu, v]42 Cama ee + [u, V] 2 le Tene) lf llz2 9 ; 
1eD 


which completes the proof. 
Let us return to our claim. The left hand side of (27) can be written as 


|e ee) mee = 5 mer (SA) 
; mv = = MmyDv. 
ieD I Vip iG) 
By our assumptions: (u,v) € ae andu—! € RH¢, for any J € D, we have 


1 Ane? \7 [u,v] 4a Agu! \? 
lJ = (—<) [I\m;(u~')myv < Tie > (—<) \|m;(u-!) 


IED(J) I€D(J) 


< [u,v] ag [uv Tengms(u") : (28) 


The last inequality (28) is an application of Lemma 2.8(b). Therefore the 
Ayua! 
my(u—!) 
intensity [u, v]ag [went 
We now can prove the claimed estimate (27), 


’ |mif| : Ayu! : ap =i 
Sime (sry) Mme = (rary) (tary) Pie 
IED IED 
uo! 2 
=D (min) (SS) teint tym 


IED 


sequence {a; := ( )°[|n2 (u! mv }1ed is a u_!-Carleson sequence with 


< inf M, 2 inn( . i [Z|m7(u~!)myv. 


ip iu") 


Finally using Lemma 2.5 with F(x) = MM, (fu)(x) and the u~'-Carleson sequence 
{a}rep with intensity [u, v] ,4 [wen , will give us that 


In? (oe -) Hm = Che vale Ma1 (0) Boge 


=1 
IED -_ (u ) 
< Clu, v) ad [uw "Teed all e2q—1y 


= Clu, v] gale Tene faq: 
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Analyzing carefully the proof above we realize that if instead of assuming u~! € 


RH¢ we assume that {|A;w~!|?mv}rep is a u~!-Carleson sequence with intensity 
C,,» the argument will go through and we will recover the Lacey-Li estimate. 


Theorem 6.2 Let (u,v) be a pair of weights such that (u,v) € Ag and 
{\Aju7!|?myvlep is a u!-Carleson sequence with intensity Cy». Then there is 
a constant C > 0 such that 


1/2 
IS Ie@—22@) < C([, v) ag + Cin) 


We leave the details of the proof to the reader. 
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Potential Operators with Mixed Homogeneity 


Calixto P. Calderén and Wilfredo Urbina 


Dedicated to the memory of Cora Sadosky 


Abstract In 1966 Cora Sadosky introduced a number of results in a remarkable 
paper “A note on Parabolic Fractional and Singular Integrals”, see Sadosky (Studia 
Math 26:295-—302, 1966), in particular, a quasi homogeneous version of Sobolev’s 
immersion theorem was discussed in the paper. Later, C. P. Calderon and T. 
Kwembe, following those ideas and incorporating the context of Fabes-Riviere 
homogeneity (Fabes and Riviere, Studia Math 27:19-38, 1966), proved a similar 
results for potential operators with kernels having mixed homogeneity. Calderén- 
Kwembe’s (Dispersal models. X Latin American School of Mathematics (Tanti, 
1991). Rev Un Mat Argent 37(3-4):212—229, 1991/1992) basic theorem was very 
much in the spirit of Sadosky’s result. The natural extension of Sadosky’s paper is 
nevertheless the joint paper by C. Sadosky and M. Cotlar (On quasi-homogeneous 
Bessel potential operators. In: Singular integrals. Proceedings of symposia in pure 
mathematics, Chicago, 1966. American Mathematical Society, Providence, 1967, 
pp 275-287) which constitutes a true tour de force through, what is now considered, 
local properties of solutions of parabolic partial differential equations. The tools are 
the introduction of “Parabolic Bessel Potentials” combined with mixed homogeneity 
local smoothness estimates. 

The aim of this paper is to extend Calder6n-Kwembe’s theorem in two directions: 
(a) establish a corresponding result in terms of mixed norms in the Benedek- 
Panzone’s sense, see Benedek and Panzone (Duke Math J 28:3-21, 1961). (b) 
establish results for the case of unbounded characteristics (integrable to the power 
r on the unit sphere). Calderén-Kwembe’s theorem can also be estated in the frame 
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Introduction 


In 1966 Cora Sadosky introduced a number of results in a remarkable paper “A 
note on Parabolic Fractional and Singular Integrals”, see [8], in particular, a quasi 
homogeneous version of Sobolev’s immersion theorem was discussed in the paper. 
Later C. P. Calderén and T. Kwembe, following those ideas and incorporating the 
context of Fabes-Riviere homogeneity [5], proved a similar results for potential 
operators with kernels having mixed homogeneity. Calderén-Kwembe’s basic 
theorem was very much in the spirit of Sadosky’s result. The natural extension 
of Sadosky’s paper is nevertheless the joint paper by C. Sadosky and M. Cotlar 
[9] which constitutes a true tour de force through, what is now considered, local 
properties of solutions of parabolic partial differential equations. The tools are the 
introduction of “Parabolic Bessel Potentials” combined with mixed homogeneity 
local smoothness estimates. 

The aim of this paper is to extend Calder6n-Kwembe’s theorem in two directions: 
(a) establish a corresponding result in terms of mixed norms in the Benedek- 
Panzone’s sense, see [1]. (b) establish results for the case of unbounded charac- 
teristics (integrable to the power r on the unit sphere). Calder6n-Kwembe’s theorem 
can also be estated in the frame of generalized homogeneity but that case will not 
be consider here, see N. Riviere [7] and A.P. Calderén and A. Torchinsky [2]. 

In what follows C is a positive real constant that may change from line to line. 


A Change of Variable of Polar Type 


Let aj,-+-+ , dy (fixed) real numbers, a; > 1 and let x = (x1,--+ ,X%) € IR", consider 
the function 


n 


x 
F(x, p) = == (1) 


2a;? 
= p J 


for a fixed x. F(x, p) is a continuous decreasing function of p (e > 0) and therefore 
there is a unique solution of F(x, e) = 1 that will be denoted as p(x). It defines a 
distance in R”, see [9] which is called parabolic distance, 


p(x, y) = p(x—y). 
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Then, we have, 


x) Xn 


aw pe) 


yet, =f eR’: |y| = 1}. 


The point x can be written (change of variables of polar type) as 


n—1 


x, = p"' cos dg; cos h2-++ cos gn—-1 = p™! | [cos go; = p'V(¢) 


i=1 
n—1 


xX. = p™ sin gd cos d2--- cos dy») = p™ sin gy I] cos d = p’ V2(d) 


i=2 
n—1 


x3 = p' sin gy cos 3+ cos ,-1 = p® sin ds | [ cos; = p? V3(¢) 
i=3 


Xn—-1 = p™" sin bp—2 COS bp—1 = P*"—'! Vn_1(P) 


i= p™ sin Pn—-1 = p™ W,(¢). 


From the construction, it follows 


YU? (G) = 1. 
i=1 
Then, the element of volume is 
dx = p!""'I(p,+++ .n—1)dpdo, (2) 
where a = (d),--- ,d,) and |a| = a; +--- +p, see [5]. 


Notice that |x;| < p% and therefore p > |x;|!/“'. On the other hand, since a; > 1, 


we have 


|x;|!/% = p| sin dj] cos d; +++ cos bn—1|/“ 


> p|sin gj; cos g;--- cos by—-1|. 


This will be used below. 
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We shall use in this paper the following distance, 


n 


B(x) = DO bail! 


i=1 


The fact that p is actually a distance, follows immediately. The triangle inequality 
is obtained since a; > 1 and hence 4 < 1, consequently, 


Dx+y) = doit yl! sD lal! + > bl! = p@) + 00). 
i=1 i=1 i=1 
From the fact that R” is a finite dimensional vector space, we know, that p and 
p are equivalent. For the sake of completeness, the equivalence will be given here 
explicitly, 


<1, ie. p% > |xil, 
p% 
then p > |x;|!/', and therefore, p > + )7y_, |xi|!/. 
On the other hand, 
lx! = pl¥i(G)|"/" = plVi(P)I, 
hence, 


n 
2/aj 2 
) |x; fay 2p, 
i=1 
and therefore, 
n 
1/a; 
ps 5 [x;|'/4, 
i=1 


An equivalent metric was also considered in C. Sadosky [8] and C. Sadosky & 
M. Cotlar [9] , when aj,--- , a, are rational numbers, 


n 
r(x) = o> |x;|"/a0)/™ 
i=1 


where m is the smallest integer that is divisible by 2a;, i= 1,2,--- ,n. 
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Additionally, P. Krée [6] considered another equivalent distance, 


F(x) = max |x;|!/%. 
1l<i<n 
Main Results 
Given an operator 
tre) = [Ko vif. 3) 
R" 


We want to study the conditions on the kernel K such that it makes the operator T 
to be L’ — L’ bounded for suitable values of p and q. 


Potential Operators of Mixed Homogeneity 


Given kernel K, we say it is a potential kernel with mixed homogeneity if 
KA" x1, AP x2, AMX) = AB-ll K(x), (4) 


for 0 < B < jal = YL, aj, a; > 1. 
If x is expressed in polar type coordinates, x = p“W(@), as above, then 


K(x1,X2.++* Xn) = ph IK). (5) 
For the case when K can be written as 


QP) 


K(x1,X2,°+* .Xy) = pel-B’ 


with || 2 || z29(6,) < M, (6) 


C. P. Calderén and T. Kwembe [3], proved that 


II <Clflp. for +=+-4. (7) 
pal 


As we have already said, the aim of this paper is to extend Calder6én-Kwembe’s 
theorem in two directions: 


(i) Establish a corresponding result in terms of mixed norms in the Benedek- 
Panzone’s sense, [1]. 
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(ii) Establish results for the case of unbounded characteristics (integrable to the 
power r on the unit sphere). 


As it was mentioned in the introduction, Calder6n-Kwembe’s theorem can also be 
extended to potential operators with generalized homogeneity, but that case will not 
be consider here, see N. Riviere [7] and A.P. Calderén and A. Torchinsky [2]. 


Estimates of Benedek-Panzone [1], Du Plessis [4] 


If the kernel K satisfies ||K(@) |loo < M, then 
[K(1.x2°+ andl S CoP IK) Ihoo, (8) 
for )°;_, Bj = B, Bj = %a;, 0 < 6; < 1. Then, one has the estimate, 


1 1 
Coen bad aye TTL Oy baal) -F 


n 1 n 1 

< — eee 

= I] |x; |1/4i(a-B) II |x;|-%) ° 
j=l ‘J df 


j=l 


For each j we have a regular (one) dimensional potential operator, that maps L”’ into 
L% with — = > — 6). 


J J 
We will denote Lp, p,,-..,», the space of measurable functions in R”, such that 
co co co 
lL llesecos esas = (f (f of If (x1. xX2.0 Xp) [P" cacy )P2/P! dacy P3/P2 ote Pal Pn! dye, )1/Pn <0, 
—oo —oo —oo 


see Benedek-Panzone [1], page 301. 
Theorem 2.1 The operator defined as Tf = K * f, where K satisfies (4); maps 
continuously Lp, py. ,p, ito Lg) gy. q,. With qi given by i = — 6, and B = 


=i 94. 


Proof It will suffice to show it for the case n = 2, the general case follows by 
induction. Let f € Ly, », and g € Lg* gt q; being the dual exponents of g;. By the 
potential theorem in dimension one applied twice, 


in 


/ / Moni eo ya| Patel 1 2)8O1, Ya)aridrady dy2 
—oo J—oo 
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1 ee 1 
-| es ( | (1,2) 8(01.y2)dxidyi )dvodys 
2 - yi |'- 


00 [2 — Y2|~% \ Jaco [a1 = 


a 1 
= [seal dln ladles 


oo |X2 — yo]! 
= Ilo poll8llg ar > 
using the duality in space with mixed L?-norms, we get the thesis, see Benedek- 
Panzone [1], page 303. oO 


It should be noted that the above proof follows closely the corresponding one in 
Benedek-Panzone [1], pages 321-322. 


Potential Operators with Not Bounded Characteristic 


Consider kernels of the form 
K(x1,%2,-++ 3m) = p *FK(Q), (9) 


with |a] = a) +--+ a,3 a; => 1,50 < B < |a|. K(@) is considered no longer 


bounded on © = {x : p(x) = 1} but instead we will assume that on © we have 


[ |K(@)|"do < 00, (10) 


for some r; | < r < oo. Here do stands for the element of “surface” on the unit 
sphere of R”. 


Theorem 2.2 Let the kernel K be such that (10) holds with 0 < 1 + é < Lif 
f € T7(R") and + = - & then calling T(f) = (K * f) we have 
(i) 


Cc : 
{x : |ZF(@)| > AS] < gL wonraoiy, (11) 


here V(a) = K(@). 


(ii) For values of p = 1 such that ; sae. 


EF > + we have 
Jal rO 


|K * fll, < Clif lp. 


for + =1_ 8 whenever Js |K@)|""do < 00. 


Pp ial’ 


Proof (ii) follows from (i) by application of Marcinkiewicz interpolation theorem. 


178 C.P. Calderén and W. Urbina 


The part (i) of the thesis will follow after few steps. We decompose the kernel 
K = K, + Ko, where K> = K if x € A where, 


= (= (0.6): pe) > CMH, 


(for an appropriated value of 6 to be determune later) and Ky = 0 otherwise; and 


K, = K — kK». For this selection 6 = 
lal—Bp =e 


Estimates for K * f. Let p > 1 and q its conjugate exponent; ; + - = 1. We will 
use the point estimate, 


IK «fs Cf IRal®ds)"- If (12) 

We will consider ||f||, = 1 and move to the evaluation of (/ |K2|?dx)!/4. Using 
Fubini’s theorem, we set 

W(a)|4d en 7 pilg) 13 

( Os fovea Xs plle=BaP °) , aa 


By using the value of 5 from above we have 


8[—(lal — B)q + lal] = —4. 


Thus the inner integral above is 


CO 
1 
_-_plal-1 = q —q 
in a=paP ap = CAG) (14) 


Xa 


then, (13) is immediately seen to be equal to 
ont [ [W|7|W|~%do, (15) 
z 
taking now the g root, we get 


|Ko *f| < CA. 


We pass now to the estimate of |K, * f|. Using Young’s inequality for the 
convolution, we obtain 


tx: |Ki * FIG) > ANS 5 UK IP. (16) 
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Using “polar” coordinates and Fubini’s theorem, we get for (16) 


B@) BO). s(la\—Cal— 
cf ae (17) 
pO Xr 
Recall that for this case 6 = ose? so the exponent for the integrand above is 
1, PB la 


lal—pB |al—pB 


Thus, the above integral is dominated by 
C | Cf [ea)|!el/e-B) doy (18) 
Avlal/(al-PB) * Js . 
Using Jensen’s inequality for the integral above and remembering that 


1 1 B _ lal—pb 


rp |al pial 


’ 


we get 
1 ; Beate 
co6f aylaoy = cr IvIy. (19) 
wu x ‘Av 


From the constructions above and keeping in mind that ||f||, = 1, we have for a 
large but fixed C, 


{x1 |K *« f|(x) > CA} C (x: |Ki * f|(x) > CA/2} U {x : |Ko * f|(x) > CA/2} 
With the appropriate selection of C we get 
{x Ky * f(x) > CA/2} = @. 
On the other hand, 


C 
es 


ix: IK #fl@) > BS 


[| WII; (20) 


An homogeneity argument gives (i) for the case p > 1. 


The limit case p = 1. Consider as above the decomposition K = K, + K». In this 
case, 
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the choice of 6 is 


1 
i 
la| — B 


Consider now f(x — y) = f(y). A pointwise estimate gives, 


= We & 
[Ko *fl < | |W(@)|do —f(p. ¢)p'"""'dp. 
3 fo)>(4@8 plal-P 


The above expression does not exceed 
WA). saa eS ae 
[i vteido SMe [ Fep.pyplttap < cA. 
Evaluation of |K, * f|. Remember that ||f||; = 1, using Young’s Theorem 
Cc Cc 
ix: i *¥ AG) > AM Ss TAMA = sli lh. (21) 


The evaluation of + || Ky ||, is obtained using Fubini’s theorem 


(258 


C a 1 1 
lal-1 7) 1+f65 
7 [ vous f pl=BP dp = 4 1+Bs [vw do 


Cc r 
= 7 Lol do. 


The proof now follows from the pattern from the previous case. Oo 
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David Cruz-Uribe, OFS 


Dedicated to the memory of Professor Cora Sadosky 


Abstract We give new and elementary proofs of one weight norm inequalities 
for fractional integral operators and commutators. Our proofs are based on the 
machinery of dyadic grids and sparse operators used in the proof of the A, 
conjecture. 
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Introduction 


The fractional integral operators, also called the Riesz potentials, are the convolution 
operators 


dy, 0O<a <n. 


ce [ FO) 


Re |x— yl" 


These operators are classical and for | < p < 7 and q defined by ; - 


1 
qd n 
satisfy I, : L? —> L4, When p = 1 they satisfy the endpoint estimate J, : L'! > 
Lt©. (Cf. Stein [24].) One weight norm inequalities for these operators were first 
considered by Muckenhoupt and Wheeden [19], who introduced the governing class 
of weights, Ap4. For 1 < p < 4% and q such that ; - ; = *,a weight (i.e., a non- 


negative, locally integrable function) w is in A, , if 


> 
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L J 
[wla,, = sup (f wt ax) : (f wl ax) " <0, 
a Vo Q 


where the supremum is taken over all cubes Q in R". When p = 1, gq = =, we say 
w € A,, if 


1 
[w]4,,, = Supess sup (f wi ar) w(x)~! < 00. 
ao xa Vo 


Muckenhoupt and Wheeden showed that when p > 1, Jy : L?(w?) > L1(w) if and 
only if w € Ap, and when p = 1, Jy : L'(w) > L1°(w?) when w € Aj4. Their 
proof used a good-A inequality relating /,, and the fractional maximal operator, 


Maf (x) = sup olf ODI dy « xo). 
Q Q 


Weighted norm inequalities for M, were proved by generalizing the earlier results 
for the Hardy-Littlewood maximal operator. A different proof of the strong type 
inequality was given in [7]: there they used Rubio de Francia extrapolation to prove 
a norm inequality relating 7, and My. 

Given b € BMO we define the commutator of a fractional integral by 


[b, Lalf (x) = bOx)laf() — Ialbf)(a) = / (b@) — bg) SO 


: eye 
These commutators were introduced by Chanillo [2], who proved that with p and g 
defined as above, [b, Ij] : L? > L‘4. He also proved that when p is an even integer, 
b € BMOis anecessary condition. (The necessity for the full range of p was recently 
shown by Chaffee [1].) Commutators are more singular than the fractional integral 
operator: this can be seen by the fact that when p = 1, the do not map L! into 
L?. For a counter-example and substitute endpoint estimate, see [6]. In this paper 
it was also shown that the strong type inequality is governed by A, weights: if 
1 <p < { andw é€ Aj), then [b, ,] : L?(w’) > L*(w‘). This proof relied on a 
sharp maximal function estimate relating the commutator, J,, and M,. A different 
proof using extrapolation to relate the commutator to an Orlicz fractional maximal 
operator was given in [7]. Yet another proof, one that gave the sharp constant in 
terms of the [w],,, characteristic, was given in [9]. This proof used a Cauchy integral 
formula argument due to Chung et al. [4]. 

In this paper we give new and elementary proofs of the one weight inequalities 
for fractional integral operators and commutators. More precisely, we prove the 
following three theorems. 
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Theorem 1.1 Given 0 <a <n, letq= Sy then for all f € L'(w), 


sup tw! ({x ER": lef (x)| > )7 < Cin, a)[wla., i Lf) )w(x) de. 


1 a 


Theorem 1.2 GivenO0 <a<nl<p< a let q be such that ; -7=% If 
w € Ap, then for all f € L?(w*), 


({[,iertnitwoytar)" = Cin.p.adtl, dea [ reorway wx)! 
Theorem 1.3 Given 0 < a <n, 1 < p < 4%, let q be such that : - = = °. If 


w € Apq then for all f € L?(w”) and b € BMO, 


( [tetarenirweay ax)’ 
R" 


max(p’ Qtit+5 ee 


<C(np.a)lwhn llamo ( i. Ya)Pcoy ax) 


1 


We prove Theorems 1.1, 1.2, and 1.3 using the machinery of dyadic grids 
and sparse operators. Dyadic fractional integral operators date back to the work 
of Sawyer and Wheeden [23]. More recently, using the machinery developed as 
part of the proof of the A, conjecture for singular integral operators (see [15, 18] 
and the references they contain) dyadic fractional integral operators were further 
developed and applied to commutators in [9-11]. The advantage of this approach 
is its simplicity: it avoids extrapolation, good-A inequalities and comparisons to the 
fractional maximal operator. One weakness of our proofs is that they do not give 
sharp dependence on the A,,, characteristic of the weights in Theorems 1.2 and 1.3: 
this is to be expected since we freely use their properties to simplify the proofs, 
whereas any sharp constant proof must be arranged to use their properties as few 
times as possible. We record the precise constant we obtain to highlight where we 
use their properties. Sharp constants for the fractional integral operator are given 
in [17], and for commutators in [9]. 

The remainder of this paper is organized as follows. In section “Preliminary 
Results” we give some preliminary results about dyadic grids, sparse operators, 
weighted fractional maximal operators, and weights. In section “Proof of Theo- 
rem 1.1” we prove Theorem 1.1. Our proof adapts an argument that seems to have 
been part of the folklore of harmonic analysis. We want to thank the anonymous 
referee for suggesting this approach; it is much simpler than our original proof, 
which adapted Sawyer’s proof of two weight weak (p, q) inequalities for fractional 
integrals [22]. In section “Proof of Theorem 1.2” we prove Theorem 1.2. Our proof 
uses ideas of Pérez [20] from his proof of two weight inequalities for fractional 
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integrals, and from the elementary proof of one weight inequalities for the Hardy- 
Littlewood maximal operator due to Christ and Fefferman [3]. In section “Proof of 
Theorem 1.3” we prove Theorem 1.3. Our proof uses some ideas from the proof of 
two weight results in [9] to reduce the problem to an estimate essentially the same 
as the one for the fractional integral in the previous section. Finally, in section “Tres 
Recuerdos de Cora Sadosky” we give some personal recollections about the late 
Cora Sadosky. 

Throughout this paper notation is standard or will be defined as needed. The 
constant n will always denote the dimension. We will denote constants by C, c, 
etc. and their value may change at each appearance. Unless otherwise specified, we 
will assume that constants can depend on p, a and n but we will keep track of the 
dependence on the A,,, characteristic explicitly. 


Preliminary Results 


Dyadic Grids and Operators 


We begin by defining dyadic grids and the dyadic fractional integral operators. 
Unless otherwise noted, the results given here are taken from [5, 9-11]. 


Definition 2.1 A collection of cubes D in R” is a dyadic grid if provided that: 


(1) If Q € D, then £(Q) = 2* for some k € Z. 
(2) If P, QE D, then PN QE {P,Q, 9}. 
(3) For every k € Z, the cubes Dy = {Q € D: £(Q) = 2" form a partition of R”. 


Definition 2.2 Given a dyadic grid D, a set S C D is sparse if for every Q € S, 


U | <5la1 
2 
Pes 
PCO 
Equivalently, if we define 
EQ) =0\ JP. 
Pes 
PCO 


then the sets E(Q) are pairwise disjoint and |E(Q)| > 5|Ql. 


The classic example of a dyadic grid and sparse families are the standard dyadic 
grid on R” and the Calder6n-Zygmund cubes associated with an L! function. See [8, 
Appendix A]. 
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We now define a dyadic version of the fractional integral operator and show that 


it can be used to bound J, pointwise. For f € ie and a cube Q, let 


1 
e= f fo) dv = [ fQa)dr. 


Given 0 < a < nand a dyadic grid D, define 


IPf@) = -1Ol* fo: xo). 


QED 


Similarly, given a sparse subset S C D, define 


F(x) = Y- IOI" Fo: xo). 


QES 


Lemma 2.3 There exists a collection {D;}_, of dyadic grids such that for 0 <a < 
n and every non-negative function f, 


Iuf (x) < Csup Ip 'f(). 


Moreover, given any dyadic grid D and a non-negative function f € L°°, there exists 
a sparse set S C D such that 


Pigs rf). 


Given b € BMO and 0 < a < a, for any dyadic grid D, define the dyadic 
commutator 


Pix) = Dott Ib(@) — bO)IFO) dy - xo). 


OED 


Lemma 2.4 There exists a collection {D;}\_, of dyadic grids such that for 0 < a < 
n, every non-negative function f, and every b € BMO, 


[[b. Lalf | < Csup CP f(x). 


Remark 2.5 It follows at once from Lemmas 2.3 and 2.4 that to prove norm inequal- 
ities for J, and [b, Jy] it will suffice to prove them for their dyadic counterparts. 
Moreover, since these dyadic integral operators are positive, we may assume that f 
is non-negative in our proofs. Finally, by Fatou’s lemma it will suffice to prove our 
results for functions f € L°°. In particular, this will let us pass to sparse operators. 
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Weighted Fractional Maximal Operators 


We begin with some basic facts about Orlicz spaces. Some of these will also be 
needed in section “Proof of Theorem 1.3” below. For further information on these 
spaces, see [21]; for their use in weighted norm inequalities, see [8]. Given a weight 
o, letdo = o dx. We define averages with respect to the measure do: 


1 
hue f fos)do = = i: fQa)do. 


Given a Young function ® and a cube Q, define the normalized Luxemburg norm 
with respect to ® and do by 


Iflao0 = int} > 0 f o( 4") ee i 


If we let O(t) = #?, | < p < ~w, then 


ieecs (f ira)? ao) = [fllnoo- 


Associated to any Young function is its associate function ®. We have the 
generalized Holder’s inequality: for any cube Q, 


f F@)g(a)| do < Clifllogcllellaoe: 


the constant depends only on ®. 7 
Hereafter, we will let ®(t) = tlog(e +42); then it can be shown that ®(t) = e’—1. 
It follows for this choice of ® that for 1 < p < ~, 


IIfllioo < IIflloac S CP) Ilp.00- 


We now define a weighted dyadic fractional maximal operator. Given a dyadic 
grid D and a weight o, for 0 < a < n define 


Me af (x) = sup o(Q)"|Iflloo0- 
QED 


If D(t) = t we write M?.,. If a = 0, then we simply write M3. or MP if ®(t) = ¢. 


Lemma 2.6 Let (ft) = tlog(e +t). Given 1 < p < co and0 < a < n, define q by 


: = ; = *. Given a weight o and a dyadic grid D, Me ow : IP(a) > L4(a). The 


same inequality holds for MP... 
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Proof This result is well-known when ®(t) = f; the proof is essentially the same 
for B(t) = tlog(e + ¢) and we sketch the details. By off-diagonal Marcinkiewicz 
interpolation (see [25]) it will suffice to prove the corresponding weak (p,q) 
inequality: 


o({xe R": MP, f(x) > Ha < . (/ F(a)? ac)’ 
, t Rn 


Fix t > 0; then we can decompose the level set as the union of disjoint cubes 
Qe€Q, CD that satisfy 


o(Q)" |Ifllo.o0 >t. 


Therefore, since the cubes in Q, are disjoint and g/p > 1, we have that 


o({x eR": Me f(@) >) = D> o(Q) 


QEQ, 


<6?) o@)** I llg05 


QEQ; 


q 
C a Pp 
—— acy (f yr ao) 
td fa) 
QEQ; 
qd 
P 
([wrae) 
geQ, “2 


y [ras 


QEQ, 


(/. yerae) 


IA 


C 
ra 


IA 
ae) 


IA 
a 


Properties of Ay,q Weights 


In this section we gather a few basic facts about the A,,, weights and the closely 
related Muckenhoupt A, weights; for further information see [12]. For 1 < p < oo, 
w € A, if 
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= 
[wla, = spf w(x) dx (f w(x)? ac) < 00. 


For p = 1,w eA, if 


[w]4, = sup ess sup (f w(x) ax) w(x)! <0. 
Q 


Q  xEQ 


It follows at once from the definition that for all p and q such that ; = ; = ©, then 


w € A, if and only if w? € A,,r = 1+ a and [w4]4, = Wine (When p = 1, we 
interpret 7 as 0.) By the duality of A, weights, w” € Ay and [w”] Ay = [wii 

As a consequence, we have that both w? and w’ are in Aco: Below we will need 
to use two properties of Ago weights; for both we give the sharp constant version. 
There are multiple definitions of the Ago characteristic (cf. [13]) but for our purposes 
we will simply use the fact that [w]4,, < C(”)[w]a,. 


Lemma 2.7 Given a weight 0 € Ap C Ago, then for any cube Q and set E C Q: 


P 
(1) (G) < fo, 2, 


|Q| o(Q) 
me oO =< a) "where s! = c(n) [Olan 


Proof The first inequality follows from the definition of A,: see [12]. The second 
follows from the sharp form of the reverse Holder inequality due to Hyt6nen and 
Pérez [16]: for every cube Q, 


(f a(x) ax) a8 of o (x) dx, 


where s = 1 + . The desired inequality follows if we apply Hdlder’s 


a 
co ]A00 


inequality with exponent s to i 0 (x) 7XE(x) dx. Oo 


Proof of Theorem 1.1 


By Remark 2.5 it will suffice to prove the weak (1,q) inequality for the dyadic 
operator J”, where D is an arbitrary dyadic grid, and for f a non-negative function 
in L&. 
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Since g > 1, || - ||zz00(,2) is a norm, so by Minkowski’s inequality, 


TF ll 2:00 wa) = 


yolk! i f0)x00) dy x00) 


QED 
<[ £0) 


To complete the proof we need to show that for almost every y, 


L1-% (1) 


dy. 


L4% (w1) 


y- 10 xo) x00) 


OED 


< CM(w)(y)7 < Cla, WO). 


L4- (w7) 


y- 10 xo) x00) 


QED 


The second inequality is just the fact that w € Aj,,, So it remains to prove the first. 
Let K = —1 Fix y € R” and t > O, and let Q, be the largest cube in D 


a 
1-2 4 


1 
containing y such that K|Q;|_ « > t. Now fix x 4 y such that 


Y- 10M * xo) xo) > 1, 


QED 
and let Q, be the smallest cube in D containing x and y. Then 
il oa 1 1 
YO *xeO)x0@) = D7 12*O.174 = KIO, 
QED k=0 
and so by maximality, x € Q,;. Since this is true for all t > 0, we have that 


= sup tw({x € R": K|Q,|74 > 2) 


L4:& (w?) 1>0 


Y= 12F* xe) x00) 


QED 


< Ksup|Q,|-7w"(Q,)4 < KM(w")(y)¢. 
t>0 


Proof of Theorem 1.2 


By Remark 2.5, it will suffice to prove the strong (p, q) inequality for f non-negative 
and in L°°. We may also replace I, by the sparse operator ee where S is any sparse 
subset of a dyadic grid D. 
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Let v = w4 ando = w~” and estimate as follows: there exists ge LY (w-%), 
gw" ||” = 1, such that 


fd =] 1, dx = m d 
IUSPrla = ff taf oes) de = Yo Jol F of g(x) dx 


OES 


= D510)" 'o(Q)v(Q)""* (fo) 0.g0(Q)* (gv-") ow. 


QES 


Since 1-* = ra + Fi by the definition of the A,,, condition and Lemma 2.7 (applied 
to both v and o) we have that 


|O\*'o(Q)v(Q)'“* < [wla,,0(Q)?(Q)* <[why,” ? o(E()? (EQ). 
() 


If we combine these two estimates, then by Hélder’s inequality and Lemma 2.6 we 
get that 


USA wllq 

= Ch” fo) 900 EO)? ¥(Q)* (gv!) 0..(E(Q)” 

QES 
<c?*? (X04 11610) ( YO) ¥ (goo? ‘(EQ)’ . 

QES OES 
+948 D -1 Pp =] p 4 

< Cw], Vp. d |, alte (fo !)(x)? do ‘ey ot (gu (x? dv 

QeS OES 


ecg?" (f eed ([, sewers) 


<Chuly,” "(foray in) (sore av) 


1 P 
—1 
= Clw Bes 7 lfwllpllgw lla’ 


q 
/ 


/ 
1 P 
= Clw ie ’ Ilfwllp. 
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Proof of Theorem 1.3 


For our proof we need two lemmas. The first is a weighted estimate for functions in 
BMO; our proof adapts ideas from Ho [14]. 


Lemma 5.1 Let ®(t) = tlog(e+1). Given a weight o € Ago, then for any b € BMO 
and any cube Q, 


lb (Pola. S Clos llbllamo. 


Proof By the John-Nirenberg inequality, there exist constants C,, C2 such that for 
every cube Q and A, t > 0, 


Cor 
x € Q: |b) — (bol > Ail < Cildlexp ( a } 


||> || amo 


Since 0 € Ago, by Lemma 2.7 we have that 


o(tx € Q: [b(x) — (bol > A1}) < 2CF"o(Q) exp ( as) . 


| llamos’ 


where s’ = c(n)[o]a,,- Let 


gly 
- ad + 2kC} \lollemos’ 
Co 


r 


= Clo]a.||Pllamo. 


where ®(t) < ke’. Then we have that 


f g (POs el) do <kotoy i. elo ({x € Q: |b(x) — (b)o| > Ath) dt 
0 Xr 0 


J SS CoAt 
< 2kC* : ——————_ } dt 
= if ¢exp( iar) 


1 oo + 
< 2kC"” i era 
0 
=i 


Therefore, by the definition of the Luxemburg norm, we get the desired inequality. 
oO 


The second lemma is a weighted variant of an estimate from [7]; when ®(t) = ¢ 
the unweighted estimate is originally due to Sawyer and Wheeden [23]. 
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Lemma 5.2 Fix 0 < a <n, a dyadic grid D, a weight o and a Young function ®. 
Then for any P € D and any function f, 


Y= Ql" o(O)Ilifllo.o0 < C(@)I|PI"o(P)IIIfllorc- 


QED 
OCP 


Proof To prove this we need to replace the Luxemburg norm with the equivalent 
Amemiya norm [21, Section 3.3]: 


[flora < int tlaf 1+ (“ "\ a o| <2liflloro- 


By the second inequality, we can fix Ay) > 0 such that the middle quantity is less 
than 3||f||o,p,,. Then by the first inequality, 


Yl" lAlliflleoc =>) dL ll" IAlliflle.o. 


QED k=0 QcP 
OCP £(Q)=2-*e(P) 
[o.@) 
a Xx 
<|P)* > 2" io fire te) as 
k=0 dX 
€(Q)=2-*L(P) 


= CIPI" Ao five(Z) ao 


< C|P|"o(P)Ilflla.r0- 


oO 


Proof of Theorem 1.3 Again by Remark 2.5, it will suffice to prove that given any 
dyadic grid D, 


ICP Flas) S Clif llr), 


where f is non-negative and in L°°. By duality there exists a non-negative function 
/ 
g ELT (w), |lgllia qa) = 1, such that 


CPF lcs) = [ CPF (x)g(x) dx 


= ya! [ f \b(x) — BOY/FO)E(«) dy dx 


OED 
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< Ylal! *f 100) — 0) (b) olf) dy i g(a) dx 


QED 
+ |O|"F |b) - |g(x) dx | f(y) dy 
Yl f ee —Woleeraxf 


=[,+h. 


We will first estimate /;. Let v = w? and o = w~”’, and let ®(t) = tlog(e + 2). 
Then by Lemma 5.1, since [o]4., < Wlana? 


h= Ditoof 0) — Walfodow)- ‘dof g(x) dx 


QED 


lA 


CY |Olfo(O)Iifo“oo0l|o- Molanf, sod 


QED 


< Cl lbllawo Yo |Ol*o(Q)lifo~ 'loonf g(x) de. 


QED 


We now want to show that we can replace the summation over cubes in D by a 
summation over a sparse subset S of D. We do this using an argument from [7]; see 
also [9]. Fix a = 2”*! and define the sets 


= {x € R": M? g(x) > a‘}. 


Then arguing exactly as in the construction of the Calderén-Zygmund cubes (see [8, 
Appendix A]), each set 2; is the union of a collection S; of maximal, disjoint cubes 
in D that have the property that a* < (g)g < 2"a*. Moreover, the set S = (J, Sx is 
sparse. 

Now let 


Ce ={QED:a* < (g)o <ak*'}. 


Then by the maximality of the cubes in S;, every cube P € Cy is contained in 
a unique cube in S;. Therefore, we can continue the above estimate and apply 
Lemma 5.2 to get 


hh < Chl lbllewo Ol IVo"'leoof, g(a) dx 


k QEC, 


< Chel llamo Dat Y> SS 1A" oO)llfom'lla.0.0 


PES; OCP 
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= Chu, Ibllavo DY [PIEOP IO lomo 80) ds 


k PES, P 


= Clw}h, ,llPllamo Y_ |P|"!o(P)v(P)'~" fo |e 0.00(P)" F gue! dv. 
PeS Q 


We can now argue exactly as in the proof of Theorem 1.2, beginning with the 
estimate (1) and applying Lemma 2.6 to complete the estimate of 7; with a constant 
(siege 
[wh, 4 P P . 
The estimate for I, is essentially the same, exchanging the roles of f and g ando 


y 
ao : ; gtl1+44+e2 
and v. This yields the above estimate except that the constant is now [w] os ao 


This completes the proof. | 


Tres Recuerdos de Cora Sadosky 


I first met Cora at an AMS sectional meeting in Burlington, Vermont, in 1995. It 
began inauspiciously: at the reception on the first night a determined looking woman 
came up to me, waved her finger under my nose and said, “I have a bone to pick with 
you. We will talk later,’ and then marched off. She found me again about 30 min later 
and proceeded to explain. The year before she had been asked by an NSF reviewer 
for her opinion of my proposal which mentioned the two weight problem for the 
Hilbert transform. She had told the reviewer to refer me to a paper by her and Mischa 
Cotlar where they gave the first (and for a long time the only) characterization 
of these pairs of weights. He did not, however, share this reference, and when I 
published the paper based on this work I did not cite it. Cora assumed that I had 
simply disregarded this advice and was understandably annoyed. However, once I 
explained that I had never received this information she immediately became much 
friendlier and invited me to visit her in Washington, D.C. 

For the next few years she took an interest in my career. Her first major 
intervention on my behalf came in 1996, when she applied her forceful personality, 
first to convince me that I must attend the E/ Escorial conference in 1996 (despite 
moving, changing jobs, and having two small children and a pregnant wife), and 
then to strong-arm funding from a colleague to pay for my trip. It was at this meeting 
that I met, among others, Carlos Pérez, and began a collaboration that has continued 
to the present day. 

Two years later, in 1998, we met again at an AMS sectional meeting in 
Albuquerque. At this meeting she picked up on a point that my Spanish colleagues 
were also making: given my name and my ancestry, I really ought to be able to 
speak Spanish. Her solution was that I should “read a good math book in Spanish.” 
She strongly recommended that I read Javier Duoandikoetxea’s book, Andlisis de 
Fourier, telling me that I would see some good mathematics as well as “learn 


Elementary Proofs of One Weight Inequalities 197 


Spanish.” For the next year I worked through the text line by line, in the process 
writing a complete translation. I approached Javi with an offer to complete the 
translation and update the notes, and together we produced an English edition. In the 
process I did in fact learn a great deal of harmonic analysis, but unfortunately, Cora’s 
original goal was not achieved: my spoken Spanish did not improve appreciably. 
Moreover, this translation had the unintended consequence of convincing large 
numbers of mathematicians from Spain, Argentina and elsewhere that I did in fact 
speak Spanish. 

Cora and I never collaborated on a paper. She suggested several projects, but my 
interests were moving away from hers and nothing came to fruition. At the time I 
never really quite understood or appreciated the support she provided at these points 
in my career, and it is only in looking back that I realize how much I owe her. So 
belatedly I say, muchisimas gracias, Cora. 
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Finding Cycles in Nonlinear Autonomous 
Discrete Dynamical Systems 


Dmitriy Dmitrishin, Anna Khamitova, Alexander M. Stokolos, and 
Mihai Tohaneanu 


Dedicated to Alexey Solyanik on his 55th birthday 


Abstract The goal of this paper is to provide an exposition of recent results of 
the authors concerning cycle localization and stabilization in nonlinear dynamical 
systems. Both the general theory and numerical applications to well-known dynam- 
ical systems are presented. This paper is a continuation of Dmitrishin et al. (Fejér 
polynomials and chaos. Springer proceedings in mathematics and statistics, vol 108, 
pp. 49-75, 2014). 


Introduction 


The problem of cycle detection is fundamental in mathematics. In this paper we 
will be mainly concerned with the problem of detecting cycles of large length in 
an autonomous discrete system x,41 = f(x,). The standard approach is to consider 
the composition map fr(x) := f(...f(x)...) and then solve the equation fr(x) = x. 
However, this approach does not work well even in some basic cases. For example, 
in the model case of the logistic map f(x) = 4x(1 — x) it leads to a polynomial 
equation of degree 27, and thus a relatively small cycle length T can give rise to 
very serious computational difficulties. 

The goal of this article is to suggest an alternative approach to the problem of 
cycle localization. In full generality the problem is very difficult and it is hard to 
believe that a universal technique could be developed. We thus start with a model 
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case of non-linear autonomous discrete dynamical systems. The simplicity of the 
setting allows us to make some progress and to develop a feasible plan for further 
developments of the method. Since one fundamental tool of dynamics, often used 
for analyzing continuous time systems, is the reduction of continuous time flow to 
its Poincaré section, which is a discrete system, understanding the case of discrete 
systems is of great help in studying continuous systems also. 

The core of the suggested method is the stabilization of the solutions by delayed 
feedback control (DFC) of a special type. We will briefly discuss a way to linearly 
stabilize the system in section“Linear Control”; however, it turns out that the linear 
DFC method has some obvious limitations regardless of the number of prehistory 
terms involved. In contrast, in the subsequent sections we will show that a certain 
nonlinear DFC schedule allows one to robustly stabilize chaotic solutions for any 
admissible range of parameters. 

The methods we developed can be considered as chaos stabilization, and we 
believe they are of interest in other disciplines. Chaos theory is a part of modern 
Physics and the majority of the publications on chaos are in Physics literature; 
for instance, problems of stability have been discussed in [1—5, 12, 14-18, 21-27]. 
There are many specialists in chaos theory who are physicists, among whom we 
mention P. Cvetanové, C. Grebogi, E. Ott, K. Pyragas, J.A. Yorke etc. On the other 
hand, many biological systems exhibit chaotic behavior as well. A fundamental 
monograph of I.D. Murray [20] contains deep and advanced discussions and 
applications of the non-linear dynamical systems to models of population growth. 


Settings 


Consider the discrete dynamical system 
Xn+1 =f (Xn), f:A-A, ACR". (1) 
where A is a convex set that is invariant under f. Let us assume that the system has 


an unstable T-cycle (x},...,x7). We define the cycle multipliers j;,..., (4, as the 
zeros of the characteristic polynomial 


T 
det | yl — | | 27@7) = 0, (2) 


j=l 


We will assume that the multipliers are located in a region M C C. 
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Stability Analysis 


A standard approach (cf. [19]) to investigate stability with no delays is to construct 
a new map that has points of the cycle as equilibriums and then linearize about the 
equilibriums. Let us consider a system with time delay in a general form 


Xn+1 = F (Xn, Xn-1; tee Xn—t)s F:R" SR", te Ly. (3) 


We will study the local stability of a cycle {n1,...,r} where nj € R”. In other 
words for all n > t + 1 the following equations are valid 


N(n+1) mod T = F(nn mod T> N(n-1) mod T>+++>5 (n—1) mod T). 


where, slightly abusing notation, we assume that T mod T = T. 
We can now consider an auxiliary system with respect to the vector 


Zn Xn—t 
2 
zi ) Xn—t+1 
Zn = . — 
(+1) x, 
of size m(t + 1): 
1 2 
ae zs , 
2 3 
He zy : 
Zn+1 = = . 
+1 +1 1 
a FO acto) 


We can now rewrite (3) in the form 


Znt1 = Fan) 


with F : R™Ct+)D _, Rat), 
Let W(z) := F(...F(z)...) be F composed with itself T-times. We can now 
analyze the system 


Ya+1 = Wyn). (4) 


Let us periodically repeat the elements of the cycle: 
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{1,25 -+6-s NTs M1. N2,+++5NT,-..}. The first 7 + 1 elements of this sequence 
form a vector 


nN 
y=] 
In the same way we define the vectors 
2 NT 
Cd eae pe 
It is clear that the vectors y7,..., 9; are equilibria of the system (4). 


The cycle {,...,r} of the system (3) is asymptotically locally stable if and 
only if all equilibriums y;, . . . , y7 of the system (4) are asymptotically locally stable. 

For the equilibrium point yy of the system (4) the Jacobi matrix is defined by the 
formula 


T 
DW (yt) = | [PFo7) (5) 


j=l 


where the matrix DF (v7) has size m(t + 1) x m(t + 1) and equals 


O TI O... O 
O OT... O 

DF(;) = dm : (6) 
O00... I 


A 99 9® (i) 
Qy Q) Q3 ... Ory) 
Here the matrices O and J are the m x m zero and identity matrices. Further, 


; a 
Q” = if , r=l,...,tt+hj=1,...,7, 
dz |, 
J 


i.e. the value of the derivative evaluated at the point YF. 
For all other equilibria y* the Jacobi matrices D¥(y;") can be computed in the 
same manner, and 


DW(y;) = DF r+) moar) ***PFO;), 
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which can be obtained from (5) by a cyclic permutation of the factors. The eigen- 
values of DW (y;) thus coincide for all 7 = 1,..., 7 (see for example [13] for more 
details.) 

If all eigenvalues of the matrix DW (y;), which are the roots of the polynomial 


det (AJ — DW(y*)) 


are less than one in absolute values then the cycle of the system (3) is locally 
asymptotically stable. 

Note that in the scalar case m = 1 the matrices DF (y;) are in Frobenius form, 
and the matrix (6) is a generalized form of the companion matrix. If the system has 
the special form (10) below, then the characteristic equation can be found explicitly 
by means of induction, see [6]. 


Linear Control 


There is a common belief that a generalized linear control 


N-1 
u=— > gj (Xn—j = Xipi) (7) 
j=l 


can stabilize the equilibrium for the whole range of the admissible multipliers of the 


system (1). In this case F(X%n, Xn—1,---.Xn—-~w—nrT) = f(%n) + u and the characteristic 
m 


equation for the system closed by the control (7) is ¥,(A) = I] Xuj(A), where 
j=l 


Ana) = AN — wAN7! + p(A) and 
pA) = aAN 1 4 aA? 4... + ay. (8) 


Here ju; are cycle multipliers, i.e. the roots of the characteristic equation (2) of the 
open loop system, while the coefficients a; and the gain ¢; are related by the bijection 


N 


=) Ge fH lyse N= 1, (9) 
k=j+1 


The proof can be done by the methods considered in the section “Stability Analysis” 
above. It is not trivial but similar to a non-linear scalar case [6]. 

In the case of real multipliers jz a careful application of Vieta’s theorem implies 
that a necessary condition for the polynomials y,,(A) to be Schur stable is 1 — 2" < 
je < 1. It turns out [25] that for any fixed ju in this range there are coefficients that 
guarantee the stability of the polynomial ,,(A) for this given ju. 
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At this point, a natural question to ask is how robust the selected control can be, 
i.e. assuming that we are given the Schur-stable polynomials y,,(A), how much can 
we perturb the multipliers jz so that y,,(A) remain Schur-stable? More rigorously, 
the inquiry is the following: 

What is the maximum length of a connected component of M? 

In [10] we discovered a remarkable fact - the answer to the above question is 4 
regardless of how large N is. Below is an idea of the proof. 


Solyanik Visualisation 


The polynomial 7,,(A) is quite complicated and difficult to study directly. Alexey 
Solyanik (Personal communication) suggested a remarkable way to visualize the 
situation. Namely, y,,(A) is a stable polynomial if and only if x,(A) = AN — 
pAN + p(d) 4 0 for |A| = 1 or 


1 7 1 Zz 
#—+~_=-# =: (2), |z| <1. 
wo y4PQ "pw" 1+) 


where z = + and q(z) = ae Therefore y,, is Schur stable if and only if 1/u ¢ 
@(D), where D = {z : |z| < 1}. This can be rewritten as pp € (C\®(D)*, where 
z* := 1/Z is an inversion. The above formula reduces the problem of stability to 
the problem of verifying whether jz is in the above set, which is still difficult, but 


more manageable. 


Kobe Quarter Theorem Application 


Now, let us expand @(z) in a power series ®(z) = z+ anz* +4327 4+...inD. If & 
is univalent, then by the K6be Quarter Theorem +D Cc @(D) and therefore 


| 1 
pL 
We were able to get in [10] a generalization of the K6be Quarter Theorem which 
allowed us to obtain the result mentioned above. We also remark that the inequality 
above explains the value 4 mentioned in the previous section. 
Finally, it was proved in [10] that if the diameter of the set of multipliers is larger 
than 16, or the diameter of any of its connected component is larger than 4, then 


for any N there is no control (7) that stabilizes equilibria of the system (1) for all 
admissible parameters of the system. 


1 
>-> <4, 
; || 
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Fig. 1 Logistic close-loop 
system 


The other obvious problem with the linear control is that the close-loop system 
can have solutions that are outside the domain of the map, i.e. the convex invariant 
set for the open-loop system is not necessarily an invariant set for the closed-loop 
system. In Fig. 1 above the solution to the logistic system closed by the control 
u = —0.01(%, — Xn+1) with x» = 0.7501 is displayed. Note that x» = 0.75 is an 
equilibrium for a logistic map while a little perturbation produces a solution that 
blows up after 30 iterations. 

These two basic obstacles - the range for of the close-loop system and the limited 
rage for the connected component for the multiplier - justify the introduction of the 
non-linear controls. 


Average Non-linear Control 


Typically, an arbitrarily chosen initial value x9 produces a chaotic solution, i.e. we 
observe strong oscillations. An efficient way to kill oscillations is averaging, as can 
be readily seen in the summability of many trigonometric series. So, we decided to 
consider a new system 


N 


N 
Sat = > afGriatr), > ae=1. (10) 
k=1 


k=1 


It is useful to rewrite the system as X,41 = f (Xn) + Un, where 


N-1 


Un = — > & (fF (%n-jr+7) —f (%n-ir)) 5 oe 


j=l 
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where a, and &,; are in one-to-one correspondence as in (9). 

We can now see that (10) in fact is the system (1) closed by the control (11). In 
this case the convex invariant set for the open-loop system is also invariant for the 
closed-loop system. We also remark that u,, = 0 for cycle points of period T, which 
implies that the closed-loop system x,41 = f(%) + Un preserves the T-cycles of the 
initial one, which is very important for us. 


Stability Analysis 


The characteristic equation for the system (10) can be written in a remarkably useful 
form 


m N R 
I] ATADHT — yy, (dean) = 0, HEM, j= l,...,m 
k=1 


j=l 


Here ju; are the multipliers of the open loop system (1). The proof for the scalar case 
m = | can be found in [6], and the vector case can be done in a similar way. Denote 


x T 
f(A) = ATA-DFT Ly (>: aa) ; 
k=1 


N 


q(z) = dae and z= . 
Then $(A) = 0 is equivalent to u~! = z(q(z))’ . So, if 
Fr(z) = z(q@)" (12) 
then the inclusion 
M c (C\Fr(D)* (13) 


guarantees the local asymptotic stability of the cycle with multipliers in the set M. 
The inclusion (13) is the Solyanik visualization in this setting. 

We are thus left with the following problem in geometric complex function 
theory: given a set M containing all the multipliers, find a properly normalized 
polynomial map z — F(z) such that M Cc (C\Fr(D)) . 

Since in the simplest case x,+1; = 4x, the solution x, = Cy” is exponentially 
blowing up for 4 > 1 there is no way to stabilize the case when one of the 
multipliers satisfies 4 > 1 by only using the small gain (11). Therefore we will 
consider only negative real multipliers, and in the complex case we will assume 
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that M is disjoint from [1, 00). We also remark that the cycle is already stable if 
the multipliers are in (—1, 1) or more generally in the unit disc D of the complex 
plane C. 

It is natural to expect that the stability will be getting worse if the set M is close 
to the half axis [1, 00). Below we will see several examples supporting this thesis. 

Since ((0, 00)\(0, 1])* = (0,1), and (0,1) is the largest admissible range for 
real positive multipliers, the best case scenario for us is a polynomial whose 
image of the unit disc looks like a narrow neighborhood of (0, 1]. Solyanik 
(Personal communication) suggested as an example of such an object a famous 
Alexander polynomial - the polynomial with the coefficients a, = 1/k, properly 
normalized [1]. The corresponding set (C\F7(D)* has a cardioid type shape and for 
a large N can cover any given point except for the real numbers z > 1. We refer the 
reader to Fig. 2, where the image of the unit disc under the Alexander polynomial 
map is the interior of the inner (orange) curve, and the set (C\Fr(D)* is the interior 
of the outer (green) curve. 

We thus know that theoretically there is a way to stabilize equilibria with 
any multipliers outside [1,0o) by the control (11). However, the length of the 
prehistory is growing exponentially with the magnitude of the multiplier. In Fig. 2 
the largest multiplier which can be covered has a magnitude of 15, while the length 
of the prehistory is 20,000. We are thus led to the problem of finding the optimal 
coefficients a; so that for a given set of multipliers M the prehistory N is as small as 
possible. 

It turns out that in the case of multipliers with negative real part the number NV 
can be shown to be much smaller, of only polynomial growth with respect to the size 
of the multipliers, which is of practical use. This will be explored in the subsequent 
sections. 


Fig. 2 Image of F(D) with 
the Alexander polynomial 
F(z), N = 20,000 
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Optimization Problem 


Let us look first at the case when the multipliers lie on the half-axis (—oo, 1). In 
this case the problem of stabilization can be reduced to the following optimization 
problem: find 
re = sup min {R (Fr(e")) : 3 (Fr(e")) = 0}. 
te al te [0,7] 

Technically speaking, this leads to a disconnected set of multipliers, as the 
boundary of (C\F7(D)* will be tangent to the real axis (see, for example, Fig. 4 
below). However, there is an easy trick to get rid of the tangent points: given € > 0 
the polynomial F%(z) = (1 + €)7!(Fr(z) + €z) satisfies 


min, {9t (Fp(e")) 23 (Fe(e")) = 0} > N? -e. 


te [0,77] 


and does not intersect the real axis except fort = 0 and t = wg. Since 
lim,_+9 F7(z) = F(z), one can use the coefficients of the polynomials F(z) instead 
of F(z) in computer simulations. In particular, it is done below. 

It can be shown that for the closed-loop system a robust stabilization (i.e. by the 
same control for all w € (—j*, 1)) of any T-cycle is possible if 


Ge) 1B) 2 1, (14) 


By duality, for any z* > 1 a robust stabilization of T-cycle in the closed-loop 
system is possible if N > N*, where N* is the minimal integer N such that (14) 
holds. Formula (14) provides a practical criterion for the choice of N given jz* and T. 


Real Multipliers, Optimal Polynomials for T = 1 


It was proved in [7] (see also [8]) that given N the largest jz such that 
(-2,1) c (C\F:@)* 


can be achieved if the coefficients of g(z) in (12) are the coefficients of a polynomial 
related to the well-known Fejér polynomial, namely 


=the ie ae, PE | OS 
4 2(N + 1) N+1 N+1 


Moreover, any N such that 


u-tan? —_ < ] 
2(N + 1) 


allows stabilization and this inequality is sharp. 
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Fig. 3. F\(D), N=12 


Fig. 4 M = (C\F,(D)* 


For the choice N = 12 we display the image F,(D) and the maximal multiplier 
set M that allows for stability in Figs. 3 and 4. 
Note that for NV = 12 the boundary for the multiplier is 
pe- tan? (17/26) < 1. 


That implies that 4 < 68 which is easy to see in Fig. 4. 
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Real Multipliers, Optimal Polynomials for T =2 


It was proved in [9] (see also [8]) that given N the largest jz such that 
(-n, 1) c C\F2)* 


can be achieved if the coefficients of q(z) in (12) are coefficients related to the Fejér 
kernel of order 2N 


2 ee 
a= oo F j=l,...,N. (16) 


Moreover, any N such that 


1 
be 2 <1 
allows stabilization of a 2-cycle and this inequality is sharp. 


For the choice N = 12 we display the image F,(D) and the maximal multiplier 
set M that allows for stability in Figs. 5 and 6. 


Fig. 5 F,(D),N = 12 
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Fig. 6 M = (C\F,(D)* 


Note that for NV = 12 the boundary for the multiplier is .- 4+, < 1. That implies 


122 — 
that 4 < 144 which is easy to see in Fig. 6. 


Real Multipliers, Quasi-optimal Polynomials, T > 3 


The case T > 3 is much more difficult compare to the cases T = 1,2. We were 
unable to employ harmonic analysis technique and had to use complex analysis 
methods. Remarkably enough, we were able to construct a family of polynomials 
that are optimal for T = 1,2 and that produce the expected estimate for the 
multiplier range if T > 3. 

Define the set of points 


_ ao +TQj-1)) bo N N-1 
y= T+W-DF J=— lees > (N-even), (S (N-od 


and the generating polynomials 


N-2 
“2 
nn(z) = 2(z+ 1) I] (z—el)(z—e~%), N-even; 
j=l 
N-1 


a 
nn(z) =Z I] (c—el)(c—e-%), N-odd. 
jel 
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Writing ny(z) in a standard form 
N 
j-1 
nv(z) =z) ql 
j=l 


we can define the following three-parameter family of polynomials 


= i¢eG= pF - 
q(z, T,0,T) => (I a) Ge", (17) 


where K is a normalization factor that makes g(1, T, 0, t) = 1. In the particular case 
o =t = 2 Kis given by 


fi 
is 


1 N=2 
—=2°2 (1—cos¢), N even, 
K : 
j=l 
and 
N=1 
1 v3 7 
"7 22 (1 —cost), N odd. 


j=l 


The polynomials (17) are substitutes for g(z) in (12) and play the same role in 
the T > 3 scenario as Fejér polynomials do in the cases T = 1,2. Because of that 
we call them quasi-optimal. 


For any T and N, by choosing o = t = 2 the relation (13) is valid for 


N=2 T 


T To fi 
——______ TT cot? 2] <1, N-even, 
ON Seger Ll’ 5 — 


j=l 
and 


T 


N-1 
2 
t; 
cot? 2] <1, N-odd. 
Le U 5 


Moreover, for large N the left hand side in the above inequalities is approximately 


T 

2 PaO 
ligt (r = ~ wt. T > oo. 
N? 2T N? 


The proof is work in preparation by the authors [11]. 
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We conjecture that the coefficients we have found are actually optimal. 


Conjecture A Assume that N and T are given. Then the largest js such that 


* 


—— 


has a magnitude proportional to N? and is achieved by picking q(z) in (12) to be 
q(z, T, 2,2). 


In favor of this conjecture are numeric simulations and the fact that for T = 1,2 
the new family coincides with the polynomials that are optimal. Moreover, Figs. 7, 9 
and 8, 10 are remarkably similar to Figs. 3, 5 and 4, 6 which correspond to the cases 
T = 1,2. 


Examples 
Example of a Quasi-optimal Polynomial for T = 3,N = 5 


Let us consider a numeric example T = 3,N = 5. In this case t, = 52/14 and 
ty = 112/14. The generating polynomial is 


3 3 
n(z) = z+2(cos sin = )z?+2(1—cos =-+sin —)z-+2(cos sin aettZ. 
The normalized factor is 


1 5x \—! lix\7! 
K = ={ 1—cos — 1 — cos —— = 0.496... 
2 14 14 


To get the region for the locations of jz one needs first to build a covering polynomial 


a: a 3 ? ss 4 
COs — sin 
> TA 7 


F3(z) = z(q(z, 3,2, 2))° = Ke( 


XN 1 : 
— sin Sz? + z) . 


cA IT 4 3 
l= 2 
( cos 7 + sin ae + 7 (cos 4 


14 
See Fig. 7. Then take the inverse and get the region displayed in Fig. 8, where the 
range for the multiplier is 7 € (—33, 1). Using the above polynomials one can get 
the estimates i?) > —0.03. We conjecture that these values are optimal. 
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Fig. 7 F;(D), N=5 


Fig. 8 (C\F3(D)* 


Example of 8-Cycle in Logistic Equation 


As an example of an application of the above method let us consider the logistic 
equation 


Xnt1 = 4%n(1 — xn). (18) 


It is well known that it has cycles of any length and that the cycles are unstable. 
Consider the problem of finding cycles of length 8. Figure 9 displays the 

polynomial images of the unit disc Fg(D) with the quasi-optimal polynomial of 

degree 12. Figure 10 displays the inverse image (C\F3(D)* 
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Fig. 9 Fg(D), N=12 


Fig. 10 (C\Fs(D)* 


To do that let us consider the system (18) and close it by the control 


N-1 


Un = — > ej (f (Xn—8)+8) — f (X%n—8;)). 


j=l 


We provide numeric simulation with the 8-cycle control above applied to the 
standard logistic equation. Figure 11 below reveals the existence of two 8-cycles. 
Moreover, the size of the control u, goes to 0 as n — o, so it provides an 
increasingly better approximation to the initial system. For n > 9000, for example, 
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Fig. 11 Two 8-cycles in logistic equation on (x,—1,X,) plane 
Fig. 12 Dynamics of logistic 
equation with x) = 0.2518 46 
0.8 
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0.5 
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0.3 
0.2 
0.1 
1430 1440 1450-1460 1470 1480 = 1490 


the control |u,| < 0.00002, so we can obtain the value of the points of the two 
8-cycles from Fig. 11 up to the fifth decimal: 

{0.25 18;0.7535;0.7429;0.7640; 0.7213;0.8042;0.6299;0.9325}, and 

{0.3408;0.8987;0.3642;0.9262;0.2733;0.7944;0.6533;0.9059 }. 

By increasing the number of iterations n one can get more digits in the cycle. 

The subtlety of the situation is well illustrated by the fact that knowledge of a 
point on a cycle does not guarantee that the whole cycle can be found numerically 
by the iterative procedure (18) because of chaotic behavior of the solutions to (18). 
Figure 12 demonstrates what happens when we plug in x) = 0.2518 in (18). Since 
the system is chaotic, the numerical simulations do not reveal the existence of the 
8-cycle. On the other hand, after adding an 8-cycle control the solutions exhibit 
8-periodicity, see Fig. 13. 
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Fig. 13. Dynamics of the 
solution in the closed-loop 
system with x) = 0.2518 


0.34 


T T ‘ T 1 
1430 1440 1450 1460 1470 1480 1490 


Let us note that the standard approach would be to search for equilibria of the 
8-folded composition of the logistic map. However this new map is a polynomial 
of degree 28 and therefore one should consider 512 roots on the interval [0,1]. 
Identifying those roots is a serious practical problem. 


Complex Multipliers, St(j) < 0. 
Complex Multipliers, Case of Equilibrium 


Let us assume that jz € {R(z) < 0} U {|z| < 1}. Note the we consider the unit disk 
because if eigenvalues are in the unit disk, we have stability without any control 
added. 

This domain may be considered as a union of the domains Mr := {|z + R| < 
R}U {|z| < 1}. If R = N/2 then choosing the polynomial map 


2 N 
r=, Oma? 
j=l 


we can guarantee that the image of the unit disc will be to the right of the line 
R(z) = —1/N. Therefore, My/2 will be included in (C\F (D))* . We illustrate this 
in Figs. 14 and 15. 
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Fig. 14 F(D), N=12 


Fig. 15 (C\F(@))* 


Burgers Map 


As an example of an application let us consider the well-known Burgers map 
Xn+1 = 0.75Xn = y, 
Yott = 1.75¥n — XnYn. 


Here different colors correspond to different initial values. The plot in Fig. 16 as 
well as several plots below are in the (x,,%,+ 1) coordinate plane. We can see that 
after adding the nonlinear control an equilibrium point is clearly revealed in Fig. 17. 
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Fig. 16 Chaos 


Fig. 17 Equilibrium, 
N=55 


Arnold’s Cat Map 


As another example, let us consider the famous Arnold’s Cat map, Fig. 18 


Xnt1 = (n+ yn) mod 1, 
(%, + 2y,) mod 1. 


Yn+1 
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Fig. 18 Chaos 


Fig. 19 T=1,N=3 


Its multipliers are real numbers, one greater than | so it is unlikely that our method 
will stabilize the equilibrium, see Fig. 19. However, it still has a regularizing effect 
on the dynamics, as can be seen in the pictures below. It would be interesting to 
understand why different orbits, corresponding to different colors, end up being 
separated by the nonlinear control, see Fig. 20. 
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Fig. 20 T = 1,N = 50 


Fig. 21 Chaos 


Ikeda 3D Map 


Let us also look at the 3D Ikeda map, Fig. 21 


XM+1= 140.9 ( cos (0.4- a) — y, sin (0.4- “s)) 
Yati = 0.9 (% sin (0.4- ° ) + Yn COS (0.4— 2) 


I+2 I+) ]? 
Znt41= V¥On- yee ye 
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0.293890899 


0.293890900 
0.5497147972 
0.5497147974 
0.5497147976 
0.5497147978 
0.5497147980 

0.5497147982 


0.293890901 


0.293890902 


0.5354416090 0.5354416100 0.5354416110 


Fig. 22 Equilibrium, N = 3 


One again notices an equilibrium, whose first few digits in the decimal expansion 
are (0.5354416, 0.5497147, 0.2938909), see Fig. 22. 


Complex Multipliers, f(z) < 0, T = 2 


We illustrate the case of cycles of length 8, i.e. N = 12, T = 8, in Figs. 23 and 24. 


Here we use F'r(z) = z(q(z,T, 1, 1)’. 
In the following few subsections we illustrate how 4-cycles become visible after 


adding the nonlinear control. 
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Fig. 23 F,(D), N = 12 


Fig. 24 (C\Fs(D)* 


Hennon Map, T=4 


Let us consider the Hennon map, Fig. 25 


X41= 1- 1.4x2 + Yn; 


Ya+l = 0.3Xn. 


Note the appearance of 4-cycles after adding a nonlinear control in Fig. 26. 
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Fig. 25 Chaos 


Fig. 26 4-cycle, N = 10 e 


T T T T T 
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—0.24 
—0.44 


—0.64 


Ikeda Map, T =4 


The 2D Ikeda map is given by the system below and is displayed in Fig. 27 and the 
4 cycle in Fig. 28 


int = 1+0.9( x c0s (0.4 — $5) — sin (0.4- 5), 
Ynt1 = 0.9 (m sin (0.4 = aa) + Yn COS (0.4 = am) ’ 
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Fig. 27 Chaos 


Lozi Map, T=4 


The Lozi map is defined by the system below and is displayed in Fig. 29 and the 
4 cycle is in Fig. 30. 


Xn+1 = 1- 1.7|x| + 0.5y, 


Ynt+1 = Xn- 
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Fig. 29 Chaos 


Fig. 30 4-cycle, N = 40 


Holmes Cubic Map, T =4 
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T T T T T T T 1 
—06 —04 —0.2 0 0.2 0.4 0.6 0.8 1 


The Holmes cubic map is defined by the system below and is displayed in Fig. 31, 


the 4 cycle in Fig. 32. 


= Yn 
—0.2x + 2.77y—y°. 
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Fig. 31 Chaos 


Fig. 32. Two 4-cycles, 
N= 18 


Chirikov Map, T =1 and T =2 


The map that describes the dynamics of the kicked rotor is given by the system 
below and is displayed in Fig. 33. Figures 34 and 35 display the dynamics of the 
systems closed by 1-cycle and 2-cycle controls correspondingly. 


Xn+yn+1.4 sin xn | 
2n 2 


+1.4 sin x, 
Yet = Dor Bets | 


Xn+1 = 2n| 
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Fig. 35 T = 2,N = 24 6 ty" 


Fig. 36 Chaos * o * * 
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Baker’s Map, T =2 


Baker’s map is defined by the system below and is displayed in Fig. 36. The systems 
closed by 2-cycle control is displayed in Fig. 37. 


Xn+1 = 2Xn = |2xn], 


5(y + [2xn]). 


Ynt+1 
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Fig. 37 T = 2,N = 20 * » 


0.24 


It is yet another example of globally non-smooth map in addition to Arnold’s Cat 
map and Chirikov map considered above. It is remarkable that in each case one can 
observe separation of initial values, thus a regularization of chaotic behavior. 


Complex Multipliers, St(j) > 0 


In this case the worst possible scenario consists of having real multipliers. Recall 
that even in the simplest system x,4; = jx, the solution x, = Cy” blows up 
exponentially and our control cannot stabilize it, since there are no oscillations 
present. Therefore, it is natural to expect that the required N will grow very fast 
as the set of multipliers M gets close to the real line. The Alexander polynomials 
illustrate this hypothesis very well. 

One piece of good news is that now we can use q(z, T, 0, T) instead of Alexander 
polynomials. There is a choice of the parameters that allows the set (13) to cover any 
part of the region C\[1, co) right to the imaginary axis as N — oo. The sets look 
like the angel wings in Fig. 39 below, or like the dragonfly wings in Fig. 41. The 
images of the unit discs are displayed on Figs. 38 and 40. The value of N = 503 
is, of course, huge, but it is much better compare to N = 20,000 for Alexander 
polynomials. The values for N are selected to highlight the difference between the 
case of negative real part multipliers, where N = 12 suffices for very negative values 
of the real part, and the positive real part multipliers, where even for a relatively 
small multipliers a very large value of N is required. 
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Fig. 38 F\(D),o = 1.2, 
t = 0.5,N = 503 


Fig. 39 (C\F\(D)* 
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Fig. 40 F\(D),o = 1.9, 
t = 0.75, N = 503 


Fig. 41 (C\F;(D)* 


—204 


—40 4 


Generalized Fejér Kernels 


In analysis there are two types of extremal non-negative polynomials introduced by 
Fejér. In a closed form they can be written as 


N+1 2 - N\2 
cos —-t sin >t 
eo (= (ae | and @”) (1) = (= (19) 


ca 
cos t — cos N+ 2 
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Graphically these two polynomials look similar; however, there is no explanation 
of that fact and no relation between these two families polynomials has been 
established. 

Surprisingly, both polynomials turned out to be involved in the problem of 
optimal stability. It was showed in [8] that the polynomials Oy ,(¢) play a central 


role in the problem of 1-cycle (equilibrium) stability, while Be Ga) are central to 
the 2-cycle stability. 

The family of complex polynomials g(z,T,2,2) that we introduced above 
generates a new family of trigonometric polynomials which contains both Fejér 
polynomials as particular cases. Denote q(z, T,2,2) by qu (z) in this section, and 


their coefficients by a Let 


jz (1), it 

eT e 
oo =3} A MH oten 
sin | 


One can check that 


1 G? (1) 
— a= - y(t), T=1,2. 
G04) ao 0) 


Letting t = (t — 2)/T we obtain the normalized version 


: — 
GY (= ae Yo (-1 tay” sin(l + G = IT). (20) 
j=l 


Some Properties of G® i 


Note that 


ay = (1+ G-IT)ay? 4, 
and that 
m(N — 2j) 
ere eae 
7 2+(N-1)T 


are double roots of Go ,(T). 
In Fig. 42 we display the plots of a few generalized Fejér polynomials (20). T = 1 
coral, T = 2 blue, T = 3 green, T = 4 red. 
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Fig. 42, Graphs of polynomials fea 1(t), T=1,2,3,4 


The classic Fejér polynomials are plotted for T = 1 and T = 2, however they 
are shifted by 2 and 2/2 correspondently because of the t substitution. These plots 
support the following 


Conjecture B The generalized Fejér polynomials 
N 
Yi(-1la\ sind + G— DT)t 


j=l 


are non-negative on [0, 7]. 


Conclusion 


We list below what we were able to prove, and what we conjecture to be true. 


(i) We were able to find the optimal coefficients for real multipliers in the case of 
an equilibrium and 2-cycle T = 1,2. 


Finding Cycles in Nonlinear Autonomous Discrete Dynamical Systems 235 


(ii) We were able to find the optimal coefficients for complex multipliers with 
(uw) < 0 in the case of an equilibrium T = 1. 

(iii) We have found coefficients for real multipliers in the case of T-cycles, T > 3, 
which vastly improve the results obtained by using Alexander polynomials. We 
conjecture that the discovered coefficients are optimal. 

(iv) We have suggested improved coefficients for the complex multipliers with 
(uw) > 0. We don’t know whether they are optimal. 

(v) We have found a family of trigonometric polynomials that contains Fejér 
polynomials as a particular case which we conjecture to be non-negative. 


We believe that the developed techniques can also be useful in the attempts 
to solve the second part of Hilbert’s 16th problem, which deals with the number 
and location of limit cycles of a planar polynomial vector field of degree n. The 
development of computational sciences made it possible to employ computers in 
this matter, thus discretizing the problem and reducing it to the problem of detecting 
of cycles of high periods in discrete settings, which is the topic of this article. 


Remembrance by Alex Stokolos 


I first met Cora in 1992 at the Miraflores Conference, Spain. That was just half 
a year after the collapse of the Soviet Union, a very difficult time in Ukraine. To 
participate in the meeting I got help from my friend Alexey Solyanik who was at 
the time a visitor at UAM and who has been a great influence on my whole career 
and life. At the banquet, Alexey and I sat at the same table as Cora and Carlos 
Segovia. I have been to many conferences and met many people since then, but I 
remember that night in Miraflores very well. Cora was just adorable, Carlos was 
great. They made fun of each other, it was an unforgettable play by two brilliant 
actors. 

Cora told me that in her junior years, American mathematicians tried to help 
young talents abroad, and she gave the example of Zygmund lecturing in Argentina. 
She tried to follow this idea, helping many Argentinian, Venezuelan, and Ukrainian 
mathematicians. 

In 1997, Cora took part in Krein’s 90th birthday anniversary conference on 
operator theory and its applications that took place in Odessa, Ukraine. There she 
met her future collaborator Dmitry Kaliuzhnyi-Verbovetskyi. The article of Dmitry 
et al is in this volume. The article of another Odessiter, Nikolai Vasilevski, is 
included in this volume as well. 

The next time I met Cora was in Williams College in 2001 while participating 
at a Special Session on “Harmonic Analysis since the Williamstown Conference 
of 1978”. It was a remarkable idea of David Cruz-Uribe and Janine Wittwer to 
celebrate that milestone symposium. The two Williamstown’s volumes conference 
proceedings have identified the development of harmonic analysis for many years. 
Some problems have been solved, many are waiting for a solution. 
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At that meeting Cora invited me to give a talk at Howard University. I went to 
Howard, gave a presentation for the seminar, met Cora’s colleagues and her husband 
Daniel. Everything was wonderful. Then more meetings, more impressions, more 
memorial moments. Last time I saw Cora was at the 10th New Mexico Analysis 
Seminar in 2007. As always, she was surrounded by people; one had to find a path 
through the crowd even to say hello. 

In 2010, news as a thunderbolt struck me - Cora passed away. I was shocked and 
depressed. Trying to pay our last tribute, my colleagues and I organized a session at 
the AMS meeting in Albuquerque in 2014; there was also an evening in memory of 
Cora during the 13th New Mexico Analysis seminar meeting. For this event I gave 
a lecture on stability of dynamical systems. I chose this topic because quite exciting 
and very subtle results in non-linear dynamics were obtained by means of classical 
harmonic and complex analysis - the subjects so close to Cora’s heart. 

Initially, the content of the talk in the preprint version was a gift to Alexey 
Solyanik. To mention the connection of Alexey to this volume allows me to point 
out that Paul Hagelstein’s article in the volume is about Solyanik’s estimates in 
harmonic analysis, the subject of Paul’s talk at SEOUL ICM 2014. Since I indirectly 
met Cora thanks to Alexey, we decided to also dedicate this article to him. 
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Smooth Analytic Functions and Model 
Subspaces 


Konstantin M. Dyakonov 


Dedicated to the memory of Cora Sadosky 


Abstract The main themes of this survey are as follows: (a) the canonical 
(Riesz—Nevanlinna) factorization in various classes of analytic functions on the 
disk that are smooth up to its boundary, and (b) model subspaces (i.e., invariant 
subspaces of the backward shift) in the Hardy spaces H? and in BMOA. It is the 
interrelationship and a peculiar cross-fertilization between the two topics that we 
wish to highlight. 
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Introduction 


Our first topic in this survey is the multiplicative structure in spaces of smooth 
analytic functions. This phrase may sound somewhat redundant, if not downright 
confusing, since every analytic function is automatically smooth (in any reasonable 
sense) on its domain. The term becomes perfectly meaningful, though, if “smooth” 
is interpreted as “smooth up to the boundary”. It is indeed the boundary smoothness 
of analytic functions that interests us here. 
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Our functions will live on the disk D := {z € C: |z| < 1}. Putting the 
smoothness issue aside (but only for a short while), let us now recall a bit of function 
theory on the disk. Suppose that f is analytic on D and not too large near the unit 
circle T := dD. Specifically, assume that f lies in some Hardy space H? with 
0 < p < ov. By definition, this means — in addition to analyticity — that 


O<r< 


1/p 
Ulla “= sup ( / yre)Pamte) es 
1 T 


if 0 < p < 00, or ||f|| yoo := supp |f| < oo if p = oo. Here and below, m denotes 
the normalized arclength measure on T. It is well known that H? functions have 
boundary values (nontangential limits) m-almost everywhere on T. We may then 
identify H? with a subspace of L? = L?(T, m) bearing in mind that the above norm, 
|| - \|we, agrees on H? with the standard L?-norm || - ||, over T (see [16, Chapter IT]). 
When 0 < p < 1, the two quantities should actually be called quasinorms rather 
than norms. 

For f as above, the function g := fll will satisfy g € LP and logg ¢€ L!. 
Moreover, these last two conditions characterize the moduli of H? functions on T. 
Now, letting u := log g and writing Pu for the harmonic extension (via the Poisson 
integral) of u from T into D, we define the outer function Og as the (essentially 
unique) analytic function on D satisfying log |O,(z)| = Pu(z). This done, we have 
O, € HP? and |O,| = 9 a.e. on T. The ratio f/O, =: @ will then be an inner 
function; that is, 0 € H® and |@| = 1 a.e. on T. Thus we arrive at the Canonical 
Factorization Theorem: the general form of an f € H? is given by f = @F, where 
is inner and F outer (so that F = O, for some ¢ as above). A further factorization 
formula for inner functions allows us to express 6 canonically in terms of its zeros 
{a,} (these are only required to satisfy )°,,(1 — |a,|) < oo) and a certain singular 
measure jz on T; see [16, Chapter II]. In summary, the original function f € H? 
is fully described by the parameters gy, {a,} and jz that emerge; and any choice of 
parameters gives rise to an f € A? via factorization. 

The terms “inner function” and “outer function” were coined by Beurling. 
Why did he call them that? An amusing, but rather controversial, explanation I 
have heard is that the identity f = OF, when written in this specific form, has 
@ (the “inner factor’) inside and F (the “outer factor”) outside. Observe that in 
some noncommutative generalizations, which we do not touch upon, the order may 
become crucial; and yes, it should be @F rather than F@. 

While quite a bit of modern 1-D complex analysis has evolved in an attempt 
to extend the H? theory to larger analytic spaces, one also feels tempted to look 
at smaller (nicer) classes, in particular, at those populated by smooth analytic 
functions. Here, the good news is that the canonical factorization theorem applies. 
The bad news is, however, that the parameters cannot be chosen freely. Indeed, most 
inner functions — actually, all the “interesting” (i.e., nonrational) ones — are highly 
oscillatory, hence discontinuous, at some points of T. Consequently, the product 
OF may only be smooth on T if the outer factor, F, is good enough and kills the 
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singularities of the (bad) inner factor, 6. To find an explicit quantitative expression 
of this interplay, for a given “smooth analytic space’, is therefore one problem to be 
dealt with. 

Our second topic is the model subspaces, alias star-invariant subspaces, in H? 
and in BMOA := BMON H!, where BMO = BMO(T) is the space of functions of 
bounded mean oscillation on T (see [16, Chapter VI]). In H?, the model subspace 
Kg generated by an inner function @ is, by definition, the orthogonal complement of 
the shift-invariant subspace 0H. Thus, 


Ko (= Kj) := H’ © OH”. (1) 


It is a reproducing kernel Hilbert space, whose kernel function k, associated with a 
point z € D is given by 


1-8) 


k-(¢) = I — Zz 


This last function is therefore in Kg for every z, and every f € Kg satisfies 


boned 


f= [ fk dm(f), ze 


It is straightforward to verify that Ke = H?N 6 R2, and we further define K (the 
H?-analogue of Kg) by putting 


Ke := H? 10 HP, l<p<o, 


where Hi := {f € H? : f(0) = 0} and the bar denotes complex conjugation. For 
smaller p’s, a more reasonable definition appears to be 


Ki, := closypKo, 0<p<l. 


These subspaces play a crucial role in the Sz.-Nagy—Foias operator model (see [20]), 
which accounts for the terminology. Now, the term “star-invariant” means invariant 
under the backward shift operator f +> (f — f(0))/z, and it follows from Beurling’s 
theorem (see [16, Chapter II]) that the general form of a closed and nontrivial star- 
invariant subspace in H? is indeed given by (1), with @ inner. A similar fact is true 
for H? when | < p < oo. 

Finally, we put 


Kuo 2= Ko OQ BMOA. 
When equipped with the BMO-norm || - ||x, K»¢ becomes a star-invariant subspace 


of BMOA,; in fact, it is the annihilator in BMOA of the shift-invariant subspace 0H : 
in H'. Of course, Kx contains Kf° and is contained in every Ki, withO <p<o. 
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While each of the two topics just mentioned has received quite a bit of attention 
in its own right, the intimate interconnection between them does not seem to have 
been noticed (until recently) or explored in any detail. It is precisely the systematic 
exploitation of this interrelationship, perhaps a kind of duality, between the two 
subjects that is characteristic of our approach. In fact, the three stories told in the 
next three sections are intended to show that results and methods pertaining to one 
of our themes cast new light on the other, and vice versa. 

Before moving any further, we need to recall the notions of Toeplitz and Hankel 
operators, since these will be crucial in what follows. We let P; and P_ denote the 


orthogonal projections from L? onto H? and onto He, respectively. Thus, 


(PLF)@):= D> F()z" and (P_F)(2) = DO F)z", 


n>0 n<O 


where F(n) =], \e"dm(e ) is the nth Fourier coefficient of F. These operators 
are then extended to L? with 1 < p < o (in which case they become bounded 
projections onto H? and Hf, the classical M. Riesz theorem tells us) and furthermore 
to L! (even though P;(L') ¢ L!). Next, given a measurable function y on T, we 
write 


Tyf = P+(Wf) and Ayf := P_(Wf), 


whenever f € H' and wf € L!. The mapping Ty (resp., Hy) is called the Toeplitz 
(resp., Hankel) operator with symbol w. 

In the special case where yw is analytic (i.e., ~ € H!'), Ty reduces to the 
multiplication map f + fw, defined at least on H™. The Toeplitz operators with 
symbols in H! are said to be coanalytic. It is also worth mentioning that the model 
subspace ie (where p > 1) or Kxg, with @ an inner function, is precisely the kernel 
of the coanalytic Toeplitz operator Tj acting on H? or BMOA. 

Because Toeplitz and Hankel operators were among Cora Sadosky’s best beloved 
mathematical creatures, their appearance in this survey seems to be appropriate (and 
is, anyway, far from incidental to the subject matter). 

We conclude this introduction with a brief outline of the rest of the paper. 
In sections “Factorization in Lipschitz—Zygmund Spaces” and “Factorization in 
Dirichlet-Type Spaces”, we look at certain smooth analytic spaces X and seek to 
characterize the pairs (f, 9), with f € X and 6 inner, which satisfy 


fOEX, (2) 
Sometimes it is more natural to replace (2) by 


fok €X forall kEN, (3) 
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and we are led to consider some other related conditions as well. In section “‘Fac- 
torization in Lipschitz—Zygmund Spaces”, the role of X is played by the analytic 
Lipschitz—Zygmund spaces A® (see the beginning of that section for definitions), 
and the pairs (f,0) with property (3) are then explicitly described by a certain 
smallness condition, to be imposed on |f| near the singularities of 6. Furthermore, 
the same smallness condition ensures that the multiplication operator g +> fg acts 
nicely on the model space K%,, or perhaps on K%,,, with n suitably large, by improving 
integrability properties of the functions therein. For instance, given 1 < p < g < oo 
and a = p_! —q7!, we prove that multiplication by a function f € A“ maps Kk 
into H? if and only if it maps 6 into A® (so that (2) holds with X¥ = A“). The case 
of smaller p’s and larger w’s leads to a minor complication involving (3) in place 
of (2), and K%,, in place of K%. 

In section “Factorization in Dirichlet-Type Spaces”, our space X is chosen 
from among the so-called Dirichlet-type spaces. Each of these is formed by the 
functions f €¢ H* whose coefficient sequence, Fi (n)}, lies in a certain weighted £7. 
An important special case is the classical Dirichlet space D, the set of analytic 
functions f on D whose derivative, f’, is square integrable over D with respect 
to the normalized area measure A; the (semi)norm ||f||p is then defined to be 


( folk |?dA) He Among other things we recover, for f € D and @ inner, the identity 


Wal = Wa + [ IFI?|6"lam, (4) 


which forms part of Carleson’s celebrated formula from [4]. Moreover, we obtain 
similar — but more sophisticated — formulas for general Dirichlet-type spaces; these 
yield the smallness conditions on f (in relation to @) that are responsible for the 
interplay between the two factors in (2), for the current choices of X¥. When X = D, 
the corresponding smallness condition reads f; |f|?|0’|dm < oo, as readily seen 
from (4). Our approach to (4) is based on the fact that the quantity ||f0||p coincides 
with the Hilbert-Schmidt norm of the Hankel operator Hyg acting from H’ to i 
(and similarly for f in place of f@). Now let {g,} be an orthonormal basis in the 
model subspace Ky. Since H7 = 6H? @ Ko, the family {02:39 U {gn} is an 
orthonormal basis in H*, and we may use it to compute the Hilbert-Schmidt norm 
of Hyg. This gives 


2 
’ 
2 


(71% = > |Ha@|, + | eran 
k>0 n 


and a further calculation shows that the two sums above reduce to the two terms on 
the right-hand side of (4). A modification of the same technique allows us to handle 
the case of a generic Dirichlet-type space. 

In section “Model Subspaces in BMOA”, we consider coanalytic Toeplitz 
operators on the model subspace K, 9, and we obtain a criterion for such an operator 
to act boundedly from K,.9 to a given analytic space X, under certain assumptions 
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on the latter. Precisely speaking, the spaces X that arise here naturally are those 
which enjoy the K-property of Havin. In other words, it will be assumed that every 
Toeplitz operator 7; with h ¢€ H® maps X boundedly into itself and satisfies 
Tr llx+x < const - ||A||,9. This property was introduced by Havin in [17], where 
he also verified it for a number of smooth analytic spaces. (It was further observed 
in [17] that every space X with the K-property admits division by inner factors: 
whenever f € X and / is an inner function such that f/J € H', it follows 
that f/I € X.) Now, the appearance of the K-property in connection with model 
subspaces of BMOA seems to reveal yet another link between the two topics of 
concern. 

The content of section “Factorization in Lipschitz-Zygmund Spaces” is essen- 
tially borrowed from the author’s papers [7, 8], while sections “Factorization in 
Dirichlet-Type Spaces” and “Model Subspaces in BMOA” are based on [10] and 
[12], respectively. It seems that a bit of self-plagiarism is unavoidable — and 
hopefully pardonable — under the circumstances. 


Factorization in Lipschitz—Zygmund Spaces 


This section deals with the Lipschitz-Zygmund spaces A* = A®(T) and their 
analytic subspaces A“. For 0 < a < oo, the space A® is defined as the set of 
all (complex-valued) functions f € C(T) that satisfy 


if loo = O(|h|"), = -hER, (5) 


where || - ||oo is the sup-norm on T, n is an integer with n > a, and Aj denotes the 
nth order difference operator with step h. (As usual, the difference operators At are 
defined by induction: one puts (A}f)(¢) := f(e"f) —f (0) and Aff := ALA‘ 'f,) 
It is well known that property (5) does not depend on the choice of n, as long as 
n > a, except possibly for the constant in the O-condition. 

The corresponding analytic subspaces are 


A® := A°N H®™, 0 <a <oo. 


Equivalently, by a theorem essentially due to Hardy and Littlewood, A® is formed 
by those holomorphic functions f on D which obey the condition 


YO @! =O(a-|)*"), ze D, 


for some (and then every) integer n with n > a; here f is the nth order derivative 
of f. The spaces A® and A® are then normed in a natural way. 

The main result of this section is Theorem 2.1 below, which characterizes the 
pairs (f,@), with f © A® and @ inner, such that f admits multiplication and/or 
division by every power of 6 in A®. The characterization involves an explicit 
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quantitative condition saying that |f(z)| must decay at a certain rate as z approaches 
the boundary along the sublevel set 


Q(0, 6) := {zED: |A(z)| < e} (6) 


with 0 < ¢ < 1. Moreover, it turns out that the same decay condition provides a 
criterion for the multiplication operator Ty : g +> fg to map the model subspace Kt, 
continuously into H%, once the exponents are related appropriately. 


Theorem 2.1 Suppose that 0 < p < oo, max(1,p) <q <«0,a =p !—q"|, and 


nis an integer with np > 1. Assume also that f € A® and @ is an inner function. The 
following conditions are equivalent: 


(i) fO € A® forallk EN. 

(ii) fO" © AY. 
(iii) The multiplication operator Ty maps Ki boundedly into H%. 
(iv) For some (or every) € € (0, 1), one has 


lf(z)| = O( = |z))*) for ze Qe). (7) 


(v) fO* € AY for allk EN. 
(vi) f0" € A®. 


It should be noted that the set (6, €) hits T precisely at those points which are 
singular for 6. Thus, (7) tells us how strongly the good factor f must vanish on 
the bad set of the problematic (nonsmooth) factor 0 in order that the products in 
question be appropriately smooth. 

Postponing the proof for a while, we first establish a few preliminary facts to lean 
upon. To begin with, we recall the Duren—Romberg-Shields theorem (see [6]) which 
allows us to identify A® with the dual of the Hardy space H", where r = (1 + a)!, 
under the pairing 


(gv) = [ evan. 


For a given w € A®, the integral above is well defined at least when g € H™, and 
we have 


lg. W)| < callellllW lla 


with some constant cy, > 0. Moreover, the norm of the functional induced by y on 
A’ is actually comparable to || W|| qe. 

The next three lemmas exploit this duality relation. The first of these was 
established by Havin in [17]; we also cite Shamoyan [24] in connection with part 
(b) below. 
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Lemma 2.2 Let0 <a <o. 


(a) [fh € H®, then the Toeplitz operator T;, maps the space A* boundedly into 
itself, with norm at most const - ||h||oo. 

(b) If f € H! and @ is an inner function such that f8 € A%, then f € A®* and 
IIfllax < const: ||fO||2. 


The constants are allowed to depend only on a. 


In Havin’s terminology, statements (a) and (b) can be rephrased by saying that 
A® has the K-property and the (weaker) f-property, respectively. To prove (a), one 
notes that 7; is the adjoint of the multiplication operator T, : g +» gh, which is 
obviously bounded on H’ with norm at most ||/||.9. To deduce (b) from (a), observe 
that f = T;(f0). 


Lemma 2.3 Suppose that 0 < p < 0, max(1,p) < q < 00, anda = p"!'—q"!. If 
f € A®, then the Hankel operator H,, defined by 


Hyg = P_(fg), g€H®, 
can be extended to a bounded linear operator mapping H? into Hi, 


Proof Putr = (1+ a)~! and qd’ = q/(q—1). Given g € H™ andhe Ht, we have 


[ Fgh an 


S callfllacllgAlly S callfllac lisp llAlly- 


| ere) ham [P-e-han 7 


Here, the last two inequalities rely on the Duren—Romberg-—Shields duality theorem 
and on Holder’s inequality. Taking the supremum over the unit-norm functions h in 


Ht, we obtain 
AFEllq < callfllacligily. 
which proves the required result. Oo 


1 1 


Lemma 2.4 Suppose that 0 < p < oo, max(1,p) < q < 00, anda =p —q™. 
Further, let f € H? and let 0 be an inner function. If P_(f0) € A“, then the operator 
T,| zoo can be extended to a bounded linear operator acting from Ki to H4, 

6 


Proof Given g € K®, put h := Zg6 (so that h € H™®) and w := P_(f6). The 
elementary identity 


P,F=7P_(F), FeéeL’, 
shows that Tg = 7H,,h. Using Lemma 2.3, we get 


I778llq = HyAllg < const: ||YllaellAllp = const: [lv lla«lgllp. 


which completes the proof. oO 
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As a final preliminary result, we list some facts about the so-called Carleson 
curves associated with an inner function; see [16, Chapter VIII] for a proof. 


Lemma 2.5 Given an inner function 0 and a number ¢ € (0,1), there exists a 
countable (possibly finite) system T., = T,(0) of simple closed rectifiable curves in 
) U T with the following properties. 


(a) The interiors of the curves in T’, are pairwise disjoint; the intersection of each 
of these curves with the circle T has zero length. 

(b) One has n < |O| < eon Tl, ND for some positive n = n(é). 

(c) The arclength |\dz| on T, ND is a Carleson measure, i.e, H! C L'(T., |dz|); 
moreover, the norm of the corresponding embedding operator is bounded by a 
constant N(¢) depending only on é. 

(d) For every F € H', the equality 


holds true, provided that the curves in the family !’, are oriented appropriately. 
Now we are in a position to prove our main result in this section. 
Proof of Theorem 2.1 The implications (i) == > (ii) and (v) => (vi) being obvious, 
our plan is to show that (ii) > (ili) (iv) > (i) (v) and also that 
(vi) => (iii). 
(ii) => (ili). Write u := 6” and let g € K°°. Note that 


fg = Trg + Hyg. (8) 
Since f € A%, Lemma 2.3 tells us that 
AF8lla S Callfllacliglly- 


Now, since fu € A®% by (ii), it follows that P_(fu) € A® (indeed, the operators P+ 
and P_ are known to map A® into itself), and Lemma 2.4 gives 


II7F8llq S CallP-(Fu)I|a2 gly. 
The last two inequalities, together with (8), imply 


lifgllq < const - [Igllp. 


where the constant does not depend on g. Obviously, 


IZglla = Wella = lifslla. 


and since KP? is dense in K? 


P, we conclude that 7; is a bounded operator from K? 
to H%, 
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(iii) ==> (iv). Fix z € D and consider the reproducing kernel k, (for K;), given by 


1 — 6(z)0(6) 
1-z 


Since k, € K5°, it follows that k? € K3?(C K’,,); indeed, 


k($) = 


Therefore, by (iii), 


I[fK'llq < const - ||k? lp. (9) 


In order to derive further information from this inequality, we now estimate its right- 
hand side from above, and the left-hand side from below. The elementary estimate 


dm(¢) Cy 
(eens. 0 
shows that 
— n 1/p 
» _(f)i-9@eo|" 
Ike llp = ( Ll fee im) 


(10) 


\/p 
<2" (/ dm(C) ) e const 
cio)” (Lea lgyrne 
since np > 1. 


Now let F stand for the outer factor of f. Using the Cauchy integral formula, 
=n ig 


we get 
1/q 
lly = ( [ Fol | S| an @) 


(1 —9@a(o))" dm(ey |" 
dz)" 12 


(= le@ry\ "4 (= la@))" 
= (irr eon)" = |F Olq- ype 


FAG) 
T 


(11) 


C= |@@)I)” 


> const - |f(z)| C—|gypna 
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In view of (10) and (11), inequality (9) now yields 
1+ — |9@))" S const - (1 — |z[)/?-/4 = const - (1 — |zl)*, 
the constant being independent of z. Hence, for 0 < ¢ < 1, we have 
If(z)| < const - (1 — e)™"( — [z|)* 
whenever z € Q2(6, €), so that (iv) holds true. 


(iv) == (i). We begin by showing that if (iv) is fulfilled with some e € (0, 1), 
then f@ € A®. Since 


f0 = Tof + Hof 


and Taf € A® (recall Lemma 2.2), it suffices to check that Haf ¢ A*. To this end, we 
take an arbitrary function g € H>° with ||g||, = 1, where r = (1 + a)~', and verify 
that the integrals Jp(Aaf)g dm are bounded in modulus by a constant independent 
of g. This will mean that the function zH@f generates a continuous linear functional 
on H’, and hence lies in A®. Writing g; := g/z and using the Carleson curves 
I’, = [',(@) as described in Lemma 2.5, we obtain 


news = ie = eee 


_ [ Ba fai Alle" lei” 
2n1 


= = 55 r |0| 
Because g, is a unit-norm function in H’, it follows easily that |g1(z)|" < (—|z|)7, 
whence 


|dz|. 


la@l'" <d—|2) Oo?" = (|e), = ze D. 


Plugging this into the preceding estimate and recalling that |9| > n(e) on P, ND, 
we find that 


1 lf (z)| 
A ; "ld. (12 
| [ane nl < oO ( x», a red if lgil’ |dz| (12) 


Since [’, ND is contained in Q(6, €), the supremum in (12) is finite by virtue of (iv). 
Also, 


[ lel” Idzl < N(e)- i aii we): 
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Taking this into account, we deduce from (12) that 


— CN&) 
~ = Bane)’ 


sup} [ctsheam : g € AO, |lgll- = 1 


where C is the constant coming from the O-condition in (iv). This means that Haf € 
A®, and hence f6 € A®, 
_ Replacing 6 by 6* and « by e* in the above argument, we similarly verify that 
fO* € A® for every k EN. 
(i) => (v). Assuming (i), we prove first that f9 € A®, or equivalently, that 


(0) (2) = O(1=|z))"™)_ as |e] 1. t2) 


For z € D and almost all ¢ € T, we have the elementary identity 


arth(6) = (86) — O@)"* + YOKE) 


k=0 
where 
vz) = iy" ( : ‘ert 
Therefore, 
is £(6)O0(6) SOMONE | 
(F8)(e) ao ent = 3 eo eae 
ont fiom (OO - Bip La “) 
= [0 (2) a mae f oars 
n! 
= Fy [P9822 ue +P e09- (Fy) (, 
where 


SS ). 


2.00) = ( ae 


In view of (i), f0" € A® for k = 0,...,7, so that T»-«f € A®, which implies 
that 


(n) 
(Tf) @ =O = le") as [el > 
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The functions ¢(z) are bounded in D, and to prove (13) it remains to verify that 


fe (0")0) 0,04 = 011 |e") as elo. 4) 


2ni = 


Denote the integral on the left-hand side by /,,(z). Since ®, € H™, it follows that 


Mn(2)| = | [0996-506 dm) < cab GON ele 


here, as before, r = (1 + a)~!. Because n > a, we have (n + 1)r > 1 and 


|®,|, <2"! (/ dm(¢) )" 2 const _ __ const 
a i Sr a a © a) ae 4) 


where the constant does not depend on z. Consequently, 


[,(z)| < const - ||P_(f@" )llae(1 — [z|)°™. 
Since 
|P_(f9 lax < CallfO IIax < 00 


by virtue of (i), the estimate (14) is thereby established. 
Thus, we have proved the implication 


(f € A”) & (i) => fO EA". 


Applying this inductively to f0, f07, etc., in place of f, we eventually deduce from 
(i) that f0" € A® for eachk € N. 

(vi) ==> (ili). Write u := 6” and suppose that g € K°°. Then gu € H5°, and 
hence 


fig = P_(fig) = Hyg. 
Therefore, 
llfglla = Ift@glla = Ai8lla S callfulla«ligllp. (15) 


where the last inequality is due to Lemma 2.3. The quantity ||fu||,« is finite in view 
of (vi), and (15) tells us that 


llfgllq < const ||g|lp 


with a constant independent of g. Thus, the multiplication operator Ty : g +> fg 
maps K? boundedly into H%, as required. oO 
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If we wish to restrict ourselves to the issue of multiplying or dividing a function 
f € A® by an inner function @ (and its powers), leaving out the model subspace part, 
we may state the result in a more concise form as follows. 


Proposition 2.6 Suppose that0 < a < co,n € N, andn > a. Given f € A® and 
an inner function 0, the four statements below are equivalent. 


(i) f0" € A®. 
(ii) fO" © AY. 
(iii) fOk € A® for allk € Z. 
(iv) Condition (7) holds for some (or every) & € (0, 1). 


To prove this, it suffices to choose exponents p and q (once @ and n are given) so 
as to make the hypotheses of Theorem 2.1 true, and then invoke the theorem. 


Remarks 1. An alternative route to Proposition 2.6 (but not to Theorem 2.1 in 
its entirety) via the pseudoanalytic extension method was found by Dyn’kin 
[15]. A similar technique was later used by the author in [11] to completely 
characterize the functions in A*, 0 < @ < 1, and in more general Lipschitz- 
type spaces, in terms of their moduli. (In particular, some equivalent forms of the 
crucial condition (7) came out as a corollary.) Subsequently, Pavlovié [21] gave 
a more elementary proof of that result from [11]. 

2. Some of the conditions in Theorem 2.1 and Proposition 2.6 would become 
simpler if we could take n = 1. This can be done if 1 < p < oo in Theorem 2.1, 
or if 0 < a < 1 in Proposition 2.6, but not in the general case. Indeed, it follows 
from Shirokov’s work (see [28, 29]) that for each a > 1, one can find f € A® 
and a Blaschke product @ such that f/@ € A®%, but fO ¢ A®. This means, in 
particular, that conditions (i) and (ii) in Proposition 2.6 are no longer equivalent 
when a > 1 andn = 1. The equivalence does hold under certain additional 
assumptions, though; these are likewise discussed in [28, 29]. See also [9, 13] for 
an alternative study of this phenomenon. 

3. Givena € (0, oo) \Z, suppose that f € A®% and @ is an inner function. Comparing 
our Proposition 2.6 with Shirokov’s earlier results (see [27—29]), one infers that 
condition (7) holds if and only if 


m(a(@)) = 0 & If(f)| = O (aor) for € € T \ o(9), (16) 


where o(6) is the set of boundary singularities for 6. The equivalence 
between (7) and (16) was also verified directly in [7, Section 2]. 

4. Theorem 2.1 and Proposition 2.6 remain valid in the case a = 0 (withn = 1 and 
1 < p = q < 0), provided that the spaces A° and A® are taken to be BMO and 
BMOA, respectively. This convention might be justified by the duality relations 
A® = (H'/(+))* and BMOA = (H!)*. The BMO versions of the above results 
are discussed in more detail in [7, Section 5]. 

5. In [13], we also considered the algebra H& := {f : f™ € H™},n © N, in place 
of A®, and we came up with an analogue of Proposition 2.6 in that context. 
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Factorization in Dirichlet-Type Spaces 


For a sequence w = {w;}?2, of nonnegative numbers, the corresponding Dirichlet- 
type space D,, is formed by those functions f € H? for which the quantity 


6a 1/2 
Ill = (> wor (17) 
k=1 


is finite. The case wz, = k corresponds to the classical Dirichlet space D(= Dx), 
the set of all functions f € H? with 


1/2 
flo := (/ Vora) <0 


(here A is the normalized area measure on D), and we have || - ||p = || - llga- 
We begin by establishing a certain orthogonality relation involving Toeplitz 
operators on Dirichlet-type spaces. 


Theorem 3.1 Given numbers 0 < w; < w. < ..., let w = {wg}po, and let 
VY = {Vehpc, be the sequence defined by 


yi=wWi, Ve =We-We-1 (k= 2,3,...). (18) 


Suppose that F € H?, @ is an inner function, and {g,\ is an orthonormal basis in Ko. 
If ® := 2TgF and hy := zTq(F gn), then 


Fil, = Wl, + SS Wall? (19) 


(the definition of || - ||, being similar to (17) above). 


To keep on the safe side, we remark that sequences with unspecified index sets, 
which we occasionally employ, are allowed to be finite (and sometimes empty). In 
particular, the orthonormal basis {g,} in Theorem 3.1 will be finite if and only if @ 
is a finite Blaschke product. 

The proof will make use of the notion of a Hilbert-Schmidt operator. Recall that, 
given two separable Hilbert spaces H, and Hp, a linear operator T : H, — Hz is 
said to be Hilbert—-Schmidt if the quantity 


1/2 
IIT lle. = (= ie) 
n 


is finite for some (or each) orthonormal basis {e,} of H,. It is well known — and 
easily shown — that this quantity does not actually depend on the choice of {e,} and 
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is therefore well defined. The set of all Hilbert-Schmidt operators from H to H2 is 
denoted by G2(H, H2). 

Also, we need a lemma that relates Hilbert-Schmidt operators to Dirichlet-type 
spaces. We state and prove it now, before proceeding with the proof of Theorem 3.1. 


Lemma 3.2 Let F € H*. Suppose that w = {w,}, and y = {yx}, are two 
sequences of nonnegative numbers related by 


w= Se CHT (20) 
k=1 


Finally, consider the multiplier map M,, acting by the rule 
Co Co 
M, (> oa =) Mraz, = z€T (21) 
k=l k=1 


(defined initially on the set of antianalytic trigonometric polynomials >, az"). 


Then the operator M,H; belongs (or has an extension belonging) to G>(H’, H3) 
if and only if F € Dy. Moreover, 


|M)HFlle. = ||Fllw- (22) 


Proof Since {z"}°2., is an orthonormal basis in H?, we have 


CO 
IM, Hell, = > My Hee" 3. (23) 
n=0 
where || - ||2 is the usual L?-norm. Letting a, := F(n), we find that 
[o) 
gz" = aa 
k=1 


whence 


Co 
M, Hp" = pa VS Vani 
k=1 
and, by the Parseval identity, 


[o.2) 
My He2"13 = > yelansel?. 
k=1 
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Plugging this into (23) and recalling (20), we obtain 


co 0 ioe) Jj 
MHS, = >_> >> yelansel? = >- lal? > v 
j= k=1 


n=0 k=1 j=1 
[o.@) 

=) wilaj? = IF, 
j=l 


which proves (22) and the lemma. oO 


Proof of Theorem 3.1 Let My, be the multiplier map defined by (21). From 
Lemma 3.2 we know that 


IF ll, = IM) Hells,- (24) 


Consider the functions G, defined (a.e. on T) by G, := 78,8. Since {g,} is an 
orthonormal basis in Kg, the same is true for {G,} (indeed, the map f t> zf0 1S 
an antilinear isometry of Kg onto itself). Furthermore, since H? = 0H’ © Kg, the 
family {0z"}°°, U {G,} forms an orthonormal basis in H?, and we may use it to 
compute the Hilbert-Schmidt norm in (24). In this way we obtain 


lo. 2) 
IM) Hells, = >— My ArO2")I3 + D0 My HpGall3 = Si + So, (25) 


n=0 n 


where S; and S, denote the two preceding sums, in the same order. The elementary 
identity 


P_y=2Pi@), gel’, (26) 
yields 
P_(F0) = 2P+(F8) = ©, 
whence 
H;(0z") = P_(F0z") = P_(P_(F6) - z") 
=P(O,) joer 
Thus, 


[o.@) 
Sy, = 0 |M,Ag2"|3 = IM, Agi, = loll, (27) 


G2 yw? 
n=0 


where the last equality relies on Lemma 3.2. 
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Another application of (26) gives 


HrG, = P_(F28,0) = 2P+(F8n9) = In, 


and so 
CO pod 2 lo, 
IM, HeGull3 = Ye |p| = D> vallinO I? = Wall 
k=l k=l 
Summing over n, we get 
Sp = > |M,AzGall3 =~ IIhall7. (28) 


Finally, we plug the identities coming from (27) and (28) into (25). Together 
with (24), this yields the required formula (19). oO 


As a consequence of Theorem 3.1, we now deduce a result of Korenblum and 
Faivyshevskii concerning the action of certain Toeplitz operators on Dirichlet-type 
spaces. (In all fairness, their original theorem in [19] gives a bit more than our 
Corollary 3.3 below. Alternative routes to that result can be found in [18] and [22].) 
To state it, we need a minor modification of the ||-||,, norm. Namely, given a sequence 
v = {vU,}°2, of positive numbers and a holomorphic function f(z) = yf (n)z" 
on D, we put 


6 1/2 
Ifllvo = (> »,Fon?) 


n=0 
(note that the value n = 0 is now included). 


Corollary 3.3 Let v = {vn}, be a nondecreasing sequence of positive numbers, 
and let @ be an inner function. Then, for every f, g € H”, we have 


Il Taf llv.o = ILFllv.0 (29) 
and 
Ilgllv.o < Ilg4llvo. (30) 
Proof Put F := zf and define ® as in Theorem 3.1, so that 
d= gk = 2Tef. 
For n = 1,2,..., let W, = Up; and w = {w,}°2,. Theorem 3.1 implies that 


P|]. < Flv. Observing that ||®||,, = ||Tafllvo and ||F lly = [fllvo. we arrive 
at (29). To prove (30), it suffices to apply (29) with f = g@. oO 
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The next fact is likewise a straightforward consequence of Theorem 3.1. 


Theorem 3.4 Let w = {w;,}f2, be a nondecreasing sequence with w, = 0, and let 
y = {yc}, be defined by (18). If f € H’, 6 is an inner function, and {g,} is an 
orthonormal basis in Kg, then 


IFAs, = WAI, + 2 lizfenll;. (31) 


Proof Put F := f6, and define ® and h,, as in Theorem 3.1. We have then 


® = Ta (f0) = Tf =f —f(0), 


whence || ®|,, = ||f|lw. Also, 


hy, = 2T7(fO8n) = S8n- 


The formula (19) therefore reduces to (31), and the proof is complete. oO 


In some special cases, Theorem 3.4 can be used to derive a more explicit form of 
the (nonnegative) “discrepancy term” 


RF, 9) := [fle — Ill? (32) 


One such case is pointed out in Theorem 3.5 below. Before stating the result, we 
need to recall some basic facts about angular derivatives. 

Given a function g € H®™ with ||@||~o = 1, we say that g has an angular 
derivative (in the sense of Carathéodory) at a point ¢ € T if both g and g’ have 
nontangential limits at ¢, the former of these being of modulus 1. (The two limits 
are then denoted by g(¢) and g’(¢), respectively.) The classical Julia-Carathéodory 
theorem (see [2, Chapter VI], [3, Chapter I] or [23, Chapter VI]) asserts that this 
happens if and only if 


( 2 
lim ‘nf OE <o. (33) 
et b= |g? 


And if (33) holds, the theorem tells us also that y’(¢) coincides with the limit of the 
difference quotient 


g(z) — 9S) 


r= 


as z — ¢ nontangentially. Moreover, |g’(¢)| will then agree with the value of the 
(unrestricted) liminf in (33), and this remains true if liminf is replaced by the 
corresponding nontangential limit. 
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Finally, if @ = BS is an inner function (with B a Blaschke product and S singular), 
then 


d 
eOl= Dip et 2 | on i 


where {a;} is the zero sequence of B and yj is the singular measure associated with S. 
This formula can be found in [1]; it holds for every point ¢ of T, with the convention 
that |0’(¢)| = oo whenever @ fails to possess an angular derivative at ¢. 


Theorem 3.5 Let o be a positive Borel measure on [0, 1] with tio y x°do (x) < 00. 
Put 
Ve = / x*do(x), k=1,2,..., 
[0,1] 


and define the sequence w = {w,}%., by (20). If f € H® and 0 is an inner function, 


then 


n=1 


sich 


Fall? = WI + [ dm(t) I, PCO do(r). 35) 


Here the value of (1 — |6(rt)|*)/(1 — r*) at r = 1 is interpreted as \0'(£)|, the 
modulus of the angular derivative of 0 at ¢. 


The proof will rely on Theorem 3.4 and on the following lemma. 


Lemma 3.6 Let 6 be an inner function, and let {g,} be an orthonormal basis in Ko. 
Then 


7” 2 
Yie@r= eer ep. (36) 
T= iP 


Furthermore, if € € T is a point at which the limits lim,1- g(r) =: gn(€) exist 
for all n, then 


X len(E)I? = 10'(6)I. (37) 


To prove the lemma, consider the reproducing kernel 


1— 6(z)0(w) 
1-—Zw 


k,(w) = 


of Kg and use Parseval’s identity to get 


1 — |A(z)|? 
Y len? = Men? = Wel = (@) = ESE 
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for z € D. This yields (36), which in turn implies (37) upon putting z = rf and 
passing to the limit as r > 17. 


Proof of Theorem 3.5 We may assume that f € D,,, since otherwise both sides 
of (35) equal oo. By Theorem 3.4, the “discrepancy term” (32) is given by 


Rv (f.6) = >> lIefenll?. (38) 


where y = {yg};2, and {g,} is some (no matter which) orthonormal basis in Kg. 
This said, we proceed by considering two special cases. 


Case I: o has no atom at 1. We may think of the disk 


V={rf: re [0,1),¢¢€T} 


as of a measure space endowed with the product measure 0 x m =: v. The 
monomials z‘ (k = 1,2,...) are then mutually orthogonal in L?(D) and have 
norms ,/Yx. Therefore, for a function A(z) = )-7-., h(k)z* in zH', we have 


Ales = > nih? = |All. 
Applying this to h, := zfgn gives 
Vol? = Willacy = f amt) [. Pip rO)Plen(re)2do(r). 
Consequently, in view of (38), 
Ro.) = Dial = fame [PCP DlentrOPdo. G9) 
By Lemma 3.6, 


0 
D lente OF = ae 


and so (39) reduces to 


ae li 


Rolf.8) = ff dnt I PURO do(r), 


which proves (35). 
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Case 2: o is the unit point mass at |. In this case, we have y, = | and wz, = k, 


so that || - ||, = || - 2 on zH?, and || - ||, = || - ||p. Therefore, we can rewrite (38) 
in the form 


W/O — UF = Yo Usten 3 = f PCP lenlG Pac 
Combining this with (37), we finally obtain 


fla — Illa = [veri iam), (40) 


which coincides with (35) under the current hypothesis on o. 


The general case being a combination of Cases 1 and 2, the required result 
follows. Oo 


Remark Recalling the identity (34) and plugging it into (40), we find that 


2. 2 lai? diL() 
roll = Wl + [OP Lie +2 [AE Jam) can 


(here, as before, {aj} is the zero sequence of @, and yx is the associated singular 
measure). This was established by Carleson in [4]. In fact, the formula given there 
is a combination of (41) and an explicit expression for the Dirichlet integral ||f||Z 
of an outer function /. 


Model Subspaces in BMOA 


It has been noticed that various smoothness properties of an inner function 0, if 
available, tend to be inherited (typically, in a weaker form) by functions in Ke . This 
phenomenon becomes especially pronounced when passing from 0 to 


Kup = Kj ABMOA, 


the star-invariant subspace of BMOA, in which case no loss of smoothness usually 
occurs. (Of course, the smoothness property in question should not be too strong 
— it should not even imply continuity — if we want a nontrivial inner function to 
have it.) A result to that effect will appear as Corollary 4.4 below; we shall deduce it 
from a more general theorem concerning the action of a coanalytic Toeplitz operator 
T;, with g € H', on Kx. However, the very meaning of the expression Tf (with 
Ff € Kxo) is not immediately clear, since the product fg need not be integrable. The 
following proposition will clarify the situation. 
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Proposition 4.1 Given f € Kxo and g € H', there exists a function ® € No<p<1H? 
such that 


\|7z,f — Pll) > 0 
for every p € (0, 1) and every sequence {g,} C H? with ||gn — g||, > 0. 


This (obviously unique) function ® is then taken to be 77f, the image of f under 
the Toeplitz operator T3. 

The proof relies on the following lemma due to Cohn (see Lemma 3.2 in [5, 
p.731]), which in turn results from an application of the (H!, BMOA) duality. 


Lemma 4.2 Let @ be inner, and let f € Kxo. Then f = P+(ZW6) for a function 
w € H™. Furthermore, y may be chosen so that ||f||« = || Woo. 


Here and below, || - ||« is the dual space norm on BMOA induced by H!. 


Proof of Proposition 4.1 Let f € Kx, g € H', and suppose {g,} is a sequence of 
H?-functions with ||g, — g||; > 0. We have then 


Tz,f = P+ (8,P+@WO)) = P+ (8:28), 
where w is related to f as in Lemma 4.2. Now put 
® := Py (gz). 


This definition makes sense, since P+ is applied to an L'-function; besides, it does 
not depend on the choice of yw. (Indeed, if yw; and wW are both eligible in the sense 
of Lemma 4.2, then Ww, — W% € 0H.) And since P+ is a continuous mapping from 
L' to every H? with 0 < p < 1 (cf. [16, p. 128]), we conclude that ® € H? and 
|7z,f — P|» — 0 for any such p. Oo 


Now suppose X is a Banach space of analytic functions on the disk, with X C H!. 
We say that X is a K-space if, for each y € H%™, the Toeplitz operator Tj acts 
boundedly from X to itself, with norm at most const - || ||oo. (This is essentially 
equivalent to saying that X enjoys the so-called K-property of Havin. The latter was 
defined in [17] by the formally weaker condition that Tj (X) C X, for all y €¢ H™, 
but the norm estimate is usually automatic.) 

Following [17], we remark that X will be a K-space provided it is (isomorphic 
to) the dual of some Banach space Y, consisting of analytic functions on D and 
satisfying the conditions 


(a) H™ 1 Y is dense in Y, and 
(b) for each y € H®, the multiplication operator f + fw acts boundedly from Y 
to itself, with norm at most const - || lo. 


(It is understood that the pairing between X and Y is given by (f,g) := A fgdm, 
which is meaningful at least for f € H°° M Y and g € X.) The Toeplitz operator 
Tj, : X — X is then the adjoint of the multiplication map in (b), which justifies our 
claim. 
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As examples of K-spaces, we list the following: 


¢ HP? withl<p<o, 

* the Hardy—Sobolev spaces H?” := {f € H? : f™ © H?} with 1 < p < oo and 
n> 1, 

* BMOA, and more generally, BMOA” := {f € H! : f™ € BMOA} with n > 0, 

* the Dirichlet-type spaces D, := {f € H? : ~,., walf(n)|? < co} associated 
with nondecreasing sequences w = {w,} of positive numbers, 

* the analytic Besov spaces By , with s > 0, p = 1, q = 1, and in particular 

° the classical Lipschitz-Zygmund spaces A* := BS, ,, with 0 < @ < oo. 


We recall that By, , is defined as the set of those analytic f on D for which the 
function 


pe (laa |, (42) 


is in L4 over the interval (0, 1) with respect to the measure dr/(1—r); here n is some 
(any) fixed integer with n > s and f{”(¢) := f (re). 

For most of the spaces considered, the K-property has been established by means 
of a duality argument, as outlined above. We refer to [17], where this is done for A® 
and some special cases of Hardy—Sobolev and Besov spaces; to [25, 26] for general 
HP” and Bos classes, as well as for BMOA™; and finally to any of [18, 19, 22] in 
connection with D,, spaces. 

As further examples of K-spaces, we mention Ki (1 < p < ow) and Kyg. Indeed, 
for g € H™, one verifies the inclusion TAK xe Ki, by noting that Ki is the kernel 
of the Toeplitz operator Tj : H? — H?, which commutes with Tz. Then one deduces 
that 75(Kxo) C Kxo, recalling that Kxg = Ki M BMOA and BMOA is a K-space. 
And, of course, the two inclusions are accompanied by the natural norm estimates: 
the norm of 7; is in both cases O(||g||o0), just as it happens for the containing spaces 
HP (1 < p < oo) and BMOA. 

The main result of this section is as follows. 


Theorem 4.3 Let 0 be an inner function, g € H', and let X be a K-space. The 
following are equivalent. 


(i) Tj acts boundedly from Kyo to X. 
(ii) Tj acts boundedly from K;° to X. 
(iii) The function 


0(z) — 0(0 
ig oe (0) 


Satisfies Tzk € X. 


Moreover, the operator norms ||Tg||k,o—x and ||Tg\|xgex are comparable to each 
other and to ||Tgk\|x. 
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In most — perhaps all — cases of interest, condition (iii) above can be further 
rephrased by saying that T;0 € X. In fact, since k = T;0 and T;Tz = T3T:, the 
implication 


Tz9 €X => Tike xX 


holds whenever X is a K-space. The converse is true provided that 1 € X and zX C 
X; indeed, 


T,6 = const + zT@k. 


In particular, we certainly have Tgk € X <=> T;6 © X when X is one of our 


smoothness classes, such as H?”, Bia A® or BMOA”), let alone H? and BMOA. 


The theorem then states that the inclusion Tgf € X holds for all f € Ko if and only 
if it holds for f = 0. 

The next fact is obtained by applying Theorem 4.3 with g = 1, in which case T3 
reduces to the identity map. 


Corollary 4.4 Given an inner function 0 and a K-space X, one has 
Kug CX <= > KG CX => kex. (43) 


And since the latter condition, k € X, is implied by (and is usually equivalent to) 
saying that 0 € X, the nontrivial part of (43) amounts to the implication 


O6E€X => Kuo CX. (44) 
Proof of Theorem 4.3 The part (i) ==> (ii) is trivially true, as is the inequality 
ITellkoosx < WlTallx.o>x- 
The part (ii) => (iii), along with the estimate 
1 
ITellapeox = 5llTakllx 
is also obvious, since k € Kf° and ||klloo < 2. 
What remains to be proved is the implication (iii) ==> (i) and its quantitative 
version 
Il Tzll xox < const - || Tzk|lx. (45) 


To this end, we fix f € Kx» and then invoke Lemma 4.2 to find a function y € H™ 
such that 


f=T58, [fll = I lloo. 
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Using the fact that coanalytic Toeplitz operators commute (and moreover, 7z7;, = 
T.;, whenever a, b and ab are H'-functions such that the operators involved are all 
well-defined), we obtain 


Tf = T; T=, 0 = 1, TgT20 = Ty Tk. (46) 
Finally, we recall that X is a K-space to get 


IITefllx < [TG llxx ll Tekllx 
S const - || ¥ loo || Tekllx 


= const - |[f||+ || Zzkllx. 


which readily implies (45). oO 
Finally, we supplement Theorem 4.3 with the following result. 
Proposition 4.5 Let 0, g and k be as above. The operator Tz acts boundedly from 
Kxo to itself if and only if Tzk € BMOA. In this case we also have 
IZefllp S CollTekll«llfllp, = 1<p<oo, 


for all f € K,°, so that Tz extends to a bounded operator on Ki. 


This might be compared to the “T(1)-” and/or “T(b)-theorem” of David, Journé 
and Semmes (cf. [14, Chapter 5] or [30, Chapter VII]), results that provide bounded- 
ness criteria for certain singular integral operators on L’. Just as in those theorems, 
we only have to test the operator on a single function. We also remark that the 
assumption T;k € BMOA can be rewritten as T39 € BMOA, and a sufficient 
condition for this to happen is that 


sup{|g(z)| : z € Q(6,€)} < co 


for some ¢ € (0, 1), where Q(0, €) is the sublevel set defined by (6). A proof of this 
last assertion can be found in [8]. 


Proof of Proposition 4.5 The first statement, concerning the action of Tz on Kx, is 
obtained by applying Theorem 4.3 with X = BMOA (or X = Kya). 

Now suppose Tk € BMOA, and let | < p < oo. Given a function f € K;°, put 
w := 2f0(= f) and note that w € H°. We have then 


f = 20 = Py (v0) = Ty, 


and so (46) remains in force. Setting h := 73k and making use of the elementary 
identity 


PF = zP_(F), FeL', 
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we can rewrite the resulting equality from (46) as 


Tf = Th = ZAgW. 


In view of Nehari’s theorem (see, e.g., [20, Part B, Chapter 1]), the assumption that 
h, and hence zh, is in BMOA implies that the Hankel operator H-, acts boundedly 


from H? to HP, with norm not exceeding C,||h||+. Consequently, 
ITefllo = |Hav |, < Colle Wp = CollAllallfllp, — f © KB. 


Finally, since K5° is dense in K (indeed, K5° contains the family of reproducing 
kernels for K}). we conclude that 7; extends to a bounded operator on K®, with the 
same norm. The proof is complete. 


References 


1. PR. Ahern, D.N. Clark, On inner functions with H’-derivative. Michigan Math. J. 21, 115-127 
(1974) 

2. R.B. Burckel, An Introduction to Classical Complex Analysis, vol. | (Academic Press, New 
York, 1979) 

3. C. Carathéodory, Theory of Functions of a Complex Variable, vol. Il (Chelsea Publishing 
Company, New York, 1954) 

4. L. Carleson, A representation formula for the Dirichlet integral. Math. Z. 73, 190-196 (1960) 

5. W.S. Cohn, Radial limits and star invariant subspaces of bounded mean oscillation. Am. J. 
Math. 108, 719-749 (1986) 

6. P.L. Duren, B.W. Romberg, A.L. Shields, Linear functionals on H’? spaces with 0 < p < 1. J. 
Reine Angew. Math. 238, 32-60 (1969) 

7. K.M. Dyakonov, Smooth functions and coinvariant subspaces of the shift operator. Algebra i 
Analiz 4(5), 117-147 (1992); translation in St. Petersburg Math. J. 4, 933-959 (1993) 

8. K.M. Dyakonov, Division and multiplication by inner functions and embedding theorems for 
star-invariant subspaces. Am. J. Math. 115, 881-902 (1993) 

9. K.M. Dyakonov, Multiplication by Blaschke products and stability of ideals in Lipschitz 
algebras. Math. Scand. 73, 246-258 (1993) 

10. K.M. Dyakonov, Factorization of smooth analytic functions via Hilbert-Schmidt operators. 
Algebra i Analiz 8(4), 1-42 (1996); translation in St. Petersburg Math. J. 8, 543-569 (1997) 

11. K.M. Dyakonov, Equivalent norms on Lipschitz-type spaces of holomorphic functions. Acta 
Math. 178, 143-167 (1997) 

12. K.M. Dyakonov, Two theorems on star-invariant subspaces of BMOA. Indiana Univ. Math. J. 
56, 643-658 (2007) 

13. K.M. Dyakonov, Blaschke products and nonideal ideals in higher order Lipschitz algebras. 
Algebra i Analiz 21(6), 182—201 (2009); translation in St. Petersburg Math. J. 21, 979-993 
(2010) 

14. E.M. Dyn’kin, Methods of the theory of singular integrals: Littlewood-Paley theory and its 
applications, in Commutative Harmonic Analysis, IV, ed. by V.P. Khavin, N.K. Nikol’skii. 
Encyclopaedia of Mathematical Sciences, vol. 42, pp. 97-194 (Springer, Berlin, 1992) 

15. E.M. Dyn’kin, The pseudoanalytic extension. J. Anal. Math. 60, 45-70 (1993) 

16. J.B. Garnett, Bounded Analytic Functions, Revised |st edn. (Springer, New York, 2007) 


266 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
24. 


25. 


26. 


21: 


28. 


29. 


30. 


K.M. Dyakonov 


V.P. Havin, The factorization of analytic functions that are smooth up to the boundary. Zap. 
Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 22, 202-205 (1971) 

V.E. Katsnel’son, Remark on canonical factorization in certain analytic function spaces. Zap. 
Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 30, 163-164 (1972); English 
transl. in J. Soviet Math. 4, 444-445 (1975) 

B.I. Korenblum, V.M. Faivyshevskii, A certain class of compression operators that are 
connected with the divisibility of analytic functions. Ukrain. Mat. Zh. 24, 692-695 (1972); 
English transl. in Ukrainian Math. J. 24, 559-561 (1973) 

N.K. Nikolski, Operators, Functions, and Systems: An Easy Reading, Volume 1: Hardy, Han- 
kel, and Toeplitz. Mathematical Surveys and Monographs, vol. 92 (American Mathematical 
Society, Providence, 2002) 

M. Pavlovié, On Dyakonov’s paper “Equivalent norms on Lipschitz-type spaces of holomor- 
phic functions”. Acta Math. 183, 141-143 (1999) 

M. Rabindranathan, Toeplitz operators and division by inner functions. Indiana Univ. Math. J. 
22, 523-529 (1972/1973) 

D. Sarason, Sub-Hardy Hilbert Spaces in the Unit Disk (Wiley, New York, 1994) 

FA. Shamoyan, Division by an inner function in certain spaces of functions that are analytic in 
the disc. Zap. Nauchn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 22, 206-208 (1971) 
FA. Shamoyan, A class of Toeplitz operators that are connected with the divisibility of analytic 
functions. Funktsional. Anal. i Prilozhen. 13(1), 83 (1979); English transl. in Funct. Anal. Appl. 
13, 70-71 (1979) 

FA. Shamoyan, Toeplitz operators and division by an inner function in some spaces of analytic 
functions. Akad. Nauk Armyan. SSR Dokl. 76(3), 109-113 (1983, Russian) 

N.A. Shirokov, Ideals and factorization in algebras of analytic functions that are smooth up to 
the boundary. Trudy Mat. Inst. Steklov. 130, 196-222 (1978) 

N.A. Shirokov, Division and multiplication by inner functions in spaces of analytic functions 
smooth up to the boundary, in Complex Analysis and Spectral Theory (Leningrad, 1979/1980). 
Lecture Notes in Mathematics, Vol. 864, pp. 413-439 (Springer, Berlin/New York, 1981) 
N.A. Shirokov, Analytic Functions Smooth Up to the Boundary. Lecture Notes in Mathematics, 
vol. 1312 (Springer, Berlin, 1988) 

E.M. Stein, Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory 
Integrals (Princeton University Press, Princeton, 1993) 


Rational Inner Functions on a Square-Matrix 
Polyball 
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Abstract We establish the existence of a finite-dimensional unitary realization for 
every matrix-valued rational inner function from the Schur—Agler class on a unit 
square-matrix polyball. In the scalar-valued case, we characterize the denominators 
of these functions. We also show that a multiple of every polynomial with no zeros 
in the closed domain is such a denominator. One of our tools is the Koranyi—Vagi 
theorem generalizing Rudin’s description of rational inner functions to the case of 
bounded symmetric domains; we provide a short elementary proof of this theorem 
suitable in our setting. 
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Introduction 


In this paper, we study rational inner functions on the Cartesian product of square- 
matrix Cartan domains of type I, i-e., on a unit square-matrix polyball, 
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B = £, x} Sesser SC penx lx 


= \z = (Z),...,Z) € Cex eccn x CHK - |Z || <lr= Lvcish}. 


We can interpret points of B as block-diagonal matrices Z = Qa, Z” with ||Z|| < 
1. Then B is a special case of a domain Dp defined by means of a matrix polynomial 
P as the set of z = (z,...,2¢) € C® satisfying ||P(z)|| < 1 (see [2, 4, 11]); in 
this case, P = Z viewed as a polynomial in matrix entries at s bf = lyetsy Ga 
r= 1,...,k, andd = a £ In particular, 6 is a special case of a Cartesian 
product of (not necessarily square) matrix Cartan domains of type I (see [10, 14]). 
The distinguished (or Shilov) boundary of B consists of k-tuples of unitary matrices, 


dsB = \z Se ZO EO 2 CO OZ a1 r= tcc 


which can also be interpreted as a set of block-diagonal unitary matrices. Notice that 
the unit polydisk D¢ is a special case of a unit square-matrix polyball where k = d, 
and £, = 1forr=1,...,k. 

We consider matrix functions in variables zy . We denote the corresponding d- 
tuple of variables by z and, for a function F, we identify F(z) = F(Z). Ans x s 
matrix-valued function F is rational inner if each matrix entry Fog is a rational 


function in d variables z which is regular in 6, and F takes unitary matrix values 
at each of its regular points on the distinguished boundary 058. Notice that the zero 
variety of the least common multiple of the denominators of the rational functions 
Fp in their coprime fraction representation has an intersection with ds of relative 
Lebesgue measure zero, which can be proved using an argument analogous to that 
of [6, Lemma 6.3]; thus almost all points of ds are regular points of F. 

Define 


k 
Tz= \r = Br”: TO = ean r= Ves ccs, 


r=1 


ay) is ad—tuple of commuting operators on a Hilbert space and ||T|| < i}. 


For T € 7z, the Taylor joint spectrum [19] of T viewed as a multioperator 


lies in the domain 6, and for a matrix-valued function F analytic on B one can 
define F(T) by means of Taylor’s functional calculus [20]; see [2] and a further 
discussion in [4]. We say that an sx s matrix-valued function F analytic on B belongs 
to the Schur—Agler class S.Az(C*) associated with BG, or rather with its defining 
polynomial P = Z, if its associated Agler norm, 
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|Fll4z = sup ||F(7)|| 
TETz 


is at most 1. In the scalar case, s = 1, we simply write S.Az. In the case of the unit 
polydisk DD“, this class coincides with the classical Schur—Agler class S.A, studied in 
the seminal paper of Agler [1]. Notice that SAz(C’) is a subclass of the Schur class 
Sz(C*) of s x s matrix-valued contractive analytic functions on B. It follows from 
[21] and [3] that these classes do not coincide, i.e., the analog of von Neumann’s 
inequality fails when d > 2, unless B = D*. Moreover, rational inner functions, 
which obviously belong to the Schur class, do not necessarily belong to the Schur— 
Agler class: an example of a rational inner function on D? which is not Schur-Agler 
was given in [12, Example 5.1]. 

In section “Scalar-Valued Rational Inner Functions”, we give a characterization 
of scalar-valued rational inner functions on G in terms of their coprime fraction rep- 
resentation. In section “Matrix-Valued Rational Inner Functions from the Schur—A- 
gler Class”, we describe matrix-valued rational inner functions on 6 that belong 
to the associated Schur—Agler class as the functions that have a finite-dimensional 
unitary realization. In section “Eventual Agler Denominators”, we characterize 
eventual Agler denominators, i.e., those multivariable polynomials which can be 
represented as the denominators of scalar rational inner functions in the Schur— 
Agler class S.Az, in terms of certain contractive determinantal representations. We 
also show that a multiple of every polynomial with no zeros on the closed domain, 
B, is an eventual Agler denominator, and we end with several open questions. 


Scalar- Valued Rational Inner Functions 


For a polynomial p in d variables Ze, ij =1,...,0,,7r = 1,...,k, where d = 
pS £2, we define its reverse with respect to B as 


r=1 


k 
P(Z) =] [aetz)*p(Z, 


r=1 


where f, is the total degree of p in the variables ra i,j = 1,...,€,. We say that 
a polynomial p is B-stable (resp., strongly B-stable) if p does not have zeros in B 
(resp., in B). 


The following result is a generalization of Rudin’s characterization of rational 
inner functions on the polydisk [18] to the case of a square-matrix polyball B. It 
appears in more generality in [17, Theorem 3.3], where Rudin’s theorem is extended 
to all bounded symmetric domains. We provide a proof that applies to the specific 
setting of B, and therefore requires less machinery. 
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Theorem 2.1 A scalar-valued function f on B is rational inner if and only if there 


exist a B-stable polynomial p and nonnegative integers m,, ..., m, such that 
f@) = [][eerzy" 2 (a) 
a e acer 
p(Z) 


r=1 


One can choose p to be coprime with D. 
For the proof of Theorem 2.1, we will need the following proposition. 


Proposition 2.2 Let p be a B-stable polynomial, and suppose that \p(Z)| = 1 for 
all Z € 0sB. Then there exist nonnegative integers m,, ..., mg such that 


k 
p(Z) = | [(detzy. 


r=1 


Proof Notice that if Z” is unitary for each r, then 


k 
P(Z)pZ) = | [detz)". (2) 


r=1 


Since ds is a uniqueness set for analytic functions (see, e.g., [14]), we have that (2) 
holds for all Z = (Z,...,Z) © Co x... x C&*&, Since detZ is an 
irreducible polynomial in matrix entries ze (see, e.g., [7, Section 61]) we obtain 
that p(Z) = TIL, det zy" for some m, < t,,r = 1,...,k. Oo 


Proof of Theorem 2.1 The sufficiency of the representation (1) for f to be rational 
inner is clear. To prove the necessity, we first write f = g/p, with p and q coprime. 
Since f is analytic in 6, we have that p is B-stable. Next, for Z € ds5 we have that 
q(Z)q(Z—!*) = p(Z)p(Z—'*). Hence the equality 


k k 
| [@etZ)"q(Zq(Z-*) = | [(detZ)*p(Z)p(Z-"*) 
r=1 r=1 
holds for every Z € 0sB, where t, is the maximum of the total degrees in ge 
i,j =1,...,4€,, ing and p. Then it also holds for all Z. Since g and p are coprime, p 
is a divisor of Tie (det Z”)* q(Z—!*). Hence 


k 
w(Z)q(Z) = J [(der2)"pZ-™9), 


r=1 
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for some polynomial u. Observe that |u(Z)| = 1 on 056. Then by Proposition 2.2 
we obtain that u(Z) = Th, (det Z)“”, with some ju, < t,, and 


k 
q(Z) = | [(detz)"pZ"*). 
r=1 
The latter equality implies that m, := t, — 4, — t, = 0, where t, is the total degree 
of p in variables Za i,j =1,...,€,. It follows that 


k 
q(Z) = | [detZ)”" PZ), 


r=1 
and (1) holds. oO 


Remark 2.3 Proposition 2.2 and Theorem 2.1 also hold when 6 is replaced by a 
Cartesian product of square-matrix Cartan domains of type II [14], 


Bym = ae (Z,...,2Z%) € B:Z =Z0T p= Dysesl) 


or by a Cartesian product of mixed type involving Cartan domains of types I and 
II. The proofs are similar after noticing that det Z™ as a polynomial in a ij = 
1,...,0,: i <j, is also irreducible when Z” = Z'; see, e.g., [7, Section 61]. 


Matrix-Valued Rational Inner Functions 
from the Schur—Agler Class 


We now characterize matrix-valued rational inner functions on the unit square- 
matrix polyball 6, which belong to the associated Schur—Agler class, in terms of 
their unitary realizations. This is a generalization of the result [5, Theorem 2.1] for 
the unit polydisk ¢ which, in turn, is a matrix-valued extension of the result from 
[16] in the scalar-valued setting (see also an earlier paper [9] for the bidisk case). 


Theorem 3.1 Ans x s matrix-valued function F on B is rational inner and belongs 
to the class SAz(C*) if and only if F has a finite-dimensional unitary realization, 
i.e., there exist nonnegative integers ny, ..., Nx and a unitary matrix 


AB k k 
€ Cri ltr t$5)x (Lat lone ts) 
aA 


such that 


F(Z) = D+ CZ,(I — AZ,) |B, 
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where Z, = (aan (Z @ I,,). If we write F = QP, where P and Q are matrix 
polynomials of total degree at most g such that P* P = Q*Q on OsB, then n,..., nk 
can be chosen so that n; < Ls(@T<), r=1,...,k. 


For the proof of Theorem 3.1, we will need the following proposition. Recall that 
a linear mapping ®: C**“ — C>*° is said to be completely positive if, for every m € 
N, the mapping ®”); (C™«y"™™" — (C>*)y"™™ defined by (6 (A));, = (Ajj), 
i,j = 1,...,m, is positive, that is, it maps every positive semidefinite matrix A to a 
positive semidefinite matrix ® (A). 


Proposition 3.2 ([8, Theorem 1]) Let ®: C**“ — C°*° be a completely positive 
linear mapping. Then there exists Y € C>*> so that D(X) = Y*(X @ Ia). 


Proof of Theorem 3.1 The sufficiency part is analogous to that of [6, Theorem 6.1]. 
To prove the necessity, let F = QP~!, where P and Q are matrix polynomials of 
total degree at most g, P*P = Q*Q on 0;B, and assume that F € S.Az(C*). Then 
by [4, Theorem 1.5] there exist separable Hilbert spaces K, and analytic functions 
H, on B with values linear operators from C* to C’ ® K, such that 


k 
P(W)*P(Z)—Q(W)* Q(Z) = YAW)" (C-WZ) BI, )H-(Z), Z,WeEB. 


r=1 


Letting Z = W = tU where |t| < 1 and U € OsB, we obtain 


P(tU)* P(tU) — Q(tU)* Q(tU) 
1 — |t\? 


ik 
= ) 0 HF (QU)H,(tU). (3) 


r=1 


Since P(tU)* P(tU) = Q(tU)* Q(tU) for all U = (U,...,U™) € 5B and |t| = 
1, the numerator of the left-hand side of (3) is a polynomial in ¢ and 7 which vanishes 
on the variety 1—t# = 0. Therefore the left-hand side of (3) is a polynomial in ¢ and ¢ 
and a trigonometric polynomial in matrix entries a ctf = Length t= Waak 
Let P, and Q, be the coefficients of z” in the polynomials P and Q, respectively, 
and let H,.~ be the coefficient of z* in the Maclaurin series for H,, where for z = 
(z1,...,Za) and @ = (a,...,@q) we set 2* = z{'--- 20". Then the zeroth Fourier 
coefficient of the left-hand side of (3) as a trigonometric polynomial in variables 
aye is 
1 * _a* 2|a| 
Te » (Py Pa — yOu) |t|"', 


lal<g 


where |a| = a, +---+ aq. Note that the preceding expression is a polynomial in 
|t|? of degree at most g — 1. The zeroth Fourier coefficient of the right-hand side 
of (3) (for sufficiently small f), 


k 


Se ell 


r=1 @ 


is therefore a polynomial in |¢|? as well, and Ht Hy. = 0 for |a| = g. 
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Consider now the completely positive map ®,: C°*% — CT xs) 
defined via 


(Xx) = COl|o|<e—1 (Hj )(X ® Tk, LOW |a|<e—1 (Aa). 


Then by Proposition 3.2 we can find matrices Y, € C#>*> such that ®,(X) = 
Y*(X @ Iw)Y,, where a = €, and b = ci Gea fe Writing Y,, = roWjq}<e-1(Yia), We 
can form a polynomial 


G,(Z) = > Yr" 


la|<g-1 
with the coefficients in C’"""**, where n, = Lae), so that 
H,(Ww)* (u — WZ) @ Ix, )H-Z) = G,(W)* (a — WO*Z) @ In, )G@), r=1,...,k, 


and 
k 
P(W)*P(Z) — OW)*O(Z) = Y>G-(W)* (= WZ) @ by, )Ge(Z). (A) 
r=1 


Rearranging the terms in (4), we obtain 


k k 
PIW)*P(Z) + YG WY" (WZ & In, GZ) = QW)" OZ) + Y° G(W)* GZ). 


r=1 r=1 
Therefore 


(Z @ In)Gi(Z) G,(Z) 


: hw : 
(Z @ In,)Gx(Z) G,(Z) 
P(Z) Q(Z) 
is a linear and isometric map from the span of the elements on the left to the span 
of the elements on the right. It may be extended (if necessary) to a unitary matrix 
AB 
so that 
CD 
AZ,G(Z) + BP(Z) = G(Z) 
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for every Z; here Z,, = a, (Z @ I,,). Solving the first equation above for G(Z) 
and then plugging the result into the second equation yield 


F(Z) = Q(Z)P"'(Z) = D+ CZ,(I — AZ,)'B. 
oO 


Remark 3.3 An analog of Theorem 3.1, with a similar proof, is also valid for 
Cartesian products of Cartan domains of type II or III [14] or for Cartesian products 
of mixed type involving Cartan domains of types I, II, and III. We recall here that a 
Cartan domain of type II (resp., III) is a (lower-dimensional) subset of a square- 
matrix Cartan domain of type I consisting of symmetric (resp., antisymmetric) 
matrices. 


Eventual Agler Denominators 


(") 
ij? 
reversive with respect to the square-matrix polyball B if v= yv, for some scalar 
y with |y| = 1. 

We have the following generalization of a result that was announced in [13] for 
the case of a unit polydisk. 


We will say that a polynomial v in z;’, i,j = 1,...,£,r=1,...,k, is almost self- 


Theorem 4.1 Let p be a B-stable polynomial which is coprime with D. Then the 
following are equivalent: 


(i) p is an eventual Agler denominator, that is, there exist nonnegative integers 
S},...,5% such that the rational inner function TK, det Z)* D (Z)/p(Z) is 
in the Schur—Agler class S Az. 

(ii) There exists an almost self-reversive polynomial v of multidegree (s,,..., 5x) 
such that p(Z)v(Z) = det — KZ,,) for some nonnegative integers n,, r = 
1,...,, and a contractive matrix K, where Z, = a, (Z @® I,,,)- 


Proof (i)=> (i) Let f(Z) = Te, (det Z)* H (Z)/p(Z) be in S.Az. By Theorem 3.1 
there exists a k-tuple n = (n,...,nx) of nonnegative integers and a unitary matrix 
[ 4 8 | such that 


f(Z) =D+ CZ, (I — AZ,)~'B. 


Using the factorization 


I—AZ, B| _ I O|[1—AZ, 0 |[I U—AZ,)'B 
=C%, BD) |=CZ,(%=AzZ)7 7 0 f(Z)} {0 I ; 
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we observe that 


' — AZ,, | 
det 

(2) = —CZ, D 
fA) = det(I — AZ,) 


and 


I-AZ,B)_, [AB ro). [2 07)... e.. 
act | _cz, | = det E A aet(| ‘| if *)) = Adet(A Zn), 


where A = det| 4 3 |. Hence 


k 
| [Get 2)” p Z) det — AZ,) = Ap(Z) det(A* — Z,,). (5) 


r=1 


Since p is B-stable and coprime with ), the polynomials p and The, (det Z)* 
do not have common factors. Therefore p divides det(/ — AZ,,), i.e., there exists a 
polynomial v so that 


p(Z)v(Z) = det — AZ,,). (6) 
Taking the polynomial reverse on both sides, we obtain 
det Z,p(Z*—!)u(Z*-!) = det(Z, — A*). 


Using (5) and (6), we obtain 


k k 
[ [et 2) F @p@u(z) = aT] (der Zz yer pz) F Zoro, 


r=1 r=1 


where deg,p is the total degree of p in the variables re . After dividing out we see 
that v is almost self-reversive and s, = n, — deg, p — deg, v. Clearly, K = Aisa 
contractive matrix. 

(ii)= (i) Suppose there exists an almost self-reversive polynomial v such that 
p(Z)v(Z) = det( — KZ,) with a contractive matrix K and a k-tuple n = 
(n,...,Nx) of nonnegative integers. Then by a straightforward modification of [12, 
Theorem 5.2] the rational inner function 


Tk Get 2)" p(Z*Dv(Z) 
p(Z)v(Z) 


276 A. Grinshpan et al. 


is Schur—Agler. Since v is almost self-reversive, the rational inner function 


ie (det 2)" #8, p(Z*-1) 
p(Z) 


is Schur—Agler, i.e., p is an eventual Agler denominator. oO 


Remark 4.2. An analog of Theorem 4.1 is valid for a Cartesian product of Cartan 
domains of type II, i.e., for a domain Bgym, or for a Cartesian product of mixed type 
involving Cartan domains of types I and II; see Remarks 2.3 and 3.3. 


The following result is a consequence of [11, Theorem 4.1] formulated for the 
case of the square-matrix polyball B. 


Theorem 4.3 Some strongly B-stable multiple of every strongly B-stable polyno- 
mial is an eventual Agler denominator. 


Proof By [11, Theorem 4.1] there exist a strongly B-stable polynomial q, a 
k-tuple of integers n = (n,...,m), and a strictly contractive matrix K € 
CU fr)<(Cr=1 &”) such that p(Z)q(Z) = det(I—KZn), where Z, = BL (Z@ 
I,,,.). Then, similarly to the last paragraph in the proof of Theorem 4.1 (with v = 1), 
one shows that pq is an eventual Agler denominator. 


Corollary 4.4 Letf be a rational inner function on B which is regular on 0sB. Then 
there exists a rational inner function g which is regular on 0sB such that fg € S Az. 


Proof By Theorem 2.1 there exists a stable polynomial p which is coprime with 
D and such that (1) holds with some nonnegative integers m,...,m,. Since f is 
regular on 056, the polynomial p is strongly B-stable. By Theorem 4.3, pq is an 
eventual Agler denominator for some strongly B-stable polynomial g. Therefore 
TIL, detZ) "£7 /q € SAz for some nonnegative integers 1,..., sg. o 


The question as to whether the assumption of regularity of f on ds8 in 
Corollary 4.4 can be removed is open. Another open question is whether Corol- 
lary 4.4 holds for matrix-valued rational inner functions. Finally, it is interesting 
to investigate the analogues of the results in this paper for the unbounded version 
of the domain B, i.e., the Cartesian product of matrix half-planes. The Cayley 
transform over the matrix variables Z™, r = 1,...,k, would allow one to obtain a 
finite-dimensional realization formula for rational inner functions on the product of 
matrix half-planes; if, in addition, the Cayley transform over the values of a function 
is applied, then one can obtain the corresponding realization formula for rational 
Cayley inner functions over the product of matrix half-planes (see [5] for the case 
of a poly-half-plane, i.e., the product of scalar half-planes). We would also like to 
mention [15] where a subclass of Cayley inner functions on the product of matrix 
half-planes, the Bessmertnyi class, was studied. 
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A Note on Local Holder Continuity of Weighted 
Tauberian Functions 


Paul Hagelstein and Ioannis Parissis 


Abstract Let M and Msg respectively denote the Hardy-Littlewood maximal 
operator with respect to cubes and the strong maximal operator on R”, and let 
w be a nonnegative locally integrable function on R”. We define the associated 
Tauberian functions CHL (a) and Cg (a) on (0, 1) by 


Cuiw(@) = sup aint € R": Mye(x) > a) 


EcR" W(E 
0<w(E)<oo 
and 
Cy (a) = sup a(t € Re Ms xe(x) > @}). 
ecr" W(E) 
0<w(E)<oo 


Utilizing weighted Solyanik estimates for M and Ms, we show that the function 

Cu lies in the local Holder class Cll) (0, 1) and Cg,, lies in the local 
; =f 

Holder class Cc"? 

dimension n. 


(0, 1), where the constant c, > 1 depends only on the 
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Introduction 


This note concerns how Solyanik estimates may be used to establish local Hélder 
continuity estimates for the Tauberian functions associated to the Hardy-Littlewood 
and strong maximal operators in the context of Muckenhoupt weights. In [4], 
Hagelstein and Parissis used Solyanik estimates to prove that the Tauberian func- 
tions Cy_(a) and Cg(a) associated to the Hardy-Littlewood and strong maximal 
operators in R” both lie in the local Hélder class C!/"(1, 00). The techniques of that 
paper are surprisingly robust, and we here will show how the weighted Solyanik 
estimates for the Hardy-Littlewood and strong maximal operators obtained in [5, 6] 
may be used to establish local Hélder smoothness estimates for the Tauberian 
functions of the Hardy-Littlewood and strong maximal operators in the weighted 
scenario. 

We now briefly review what Solyanik estimates are and how they may be 
used to establish local smoothness estimates for Tauberian functions associated 
to geometric maximal operators in the setting of Lebesgue measure. Let B be a 
collection of sets of positive measure in IR”, and define the associated geometric 
maximal operator Mg by 


Wis a= [ IF. 


x€REB |R| 


For 0 < aw < 1, the associated Tauberian function Cg(q) is given by 


1 
Cg(a):= sup —|{x eR”: Mexe(X) > @}I. 
ecr" |E| 


0<|E|<oo 


Our ordinary expectation is that, provided 6 is a basis with reasonable differ- 
entiation properties, for 0 < a < 1 anda very close to 1, we should have 
{x € R" : Mgze(x) > a@}| is very close to |EZ| itself, and accordingly that 
Cx3(@) is very close to 1. Solyanik estimates provide a quantitative validation of 
this expectation. In particular, we have the following theorem due to Solyanik [9]; 
see also [3]. 


Theorem 1.1 (Solyanik [9]) We have the following Solyanik estimates for the 
Hardy-Littlewood and the strong maximal operator: 


(a) Let M denote the uncentered Hardy-Littlewood maximal operator on R" with 
respect to cubes, and define the associated Tauberian function Cy. (a) by 


1 
Cur(a) = sup —|{x € R": Myz(Q) > @}JI. 
ecr" |E| 


0<|E|<oo 
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Then for a € (0, 1) sufficiently close to 1 we have 
CuL(@) — 1 Sn (la). 


(b) Let Mg denote the strong maximal operator on R", and define the associated 
Tauberian function Cg(a) by 


1 
Cs(a) = sup —|{x € R”: Msyz(x) > @}I. 
ecrr |E| 


0<|E|<o0 
Then for a € (0, 1) sufficiently close to 1 we have 
Cs(e) — 1S, Ua)". 
The following theorem associated to the embedding of so-called halo sets enables 


us to relate Solyanik estimates to Hélder smoothness estimates. 


Theorem 1.2 (Hagelstein and Parissis [4]) Let B be a homothecy invariant 
collection of rectangular parallelepipeds in IR". Givena set E C R" of finite measure 
and 0 < a < 1, define the associated halo set Hy(E) by 


Hpa(E) = {x ER": Mpye(x) > a}. 
Then for all a,5 € (0,1) witha < 1—6, we have 


HealE) C Hp aito-ot+s) (He 1—s(E)). 


An immediate corollary of this theorem is the following. 


Corollary 1.3 (Hagelstein and Parissis [4]) Let 6 be a homothecy invariant 
collection of rectangular parallelepipeds in R" and let a,5 € (0,1). Then for 
a <1—6 we have 


Cu(a) < Ca(a(1 + 2-"*8))Cg( — 8). 


Now, we of course have that Cg(q) is nonincreasing on (0,1). If 6 is the 
collection of rectangular parallelepipeds in R” whose sides are parallel to the axes 
(so that Mg = Ms), we can accordingly combine the above corollary with the 
Solyanik estimates for Ms provided by Theorem 1.1 to relatively easily obtain the 
following. 


Corollary 1.4 (Hagelstein and Parissis [4]) Let CyH_(@) and Cs(a) respectively 
denote the Tauberian functions associated to the Hardy-Littlewood maximal opera- 
tor with respect to cubes and the strong maximal operator in IR" with respect to a. 
Then 


Cut € C'/"(0,1) and Cg € C'/"(0, 1). 
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The purpose of this note is to establish weighted analogues of Corollary 1.4. 
To make this precise let us consider a non-negative, locally integrable function 
w on R". The relevant Tauberian functions Cy_,,(@) and Cg,,(a) are defined on 
(0, 1) by 


Cuiw(@) = sup a cagihy € R”: Myze(x) > a}) 
EcR" W(E) 


0<w(E)<oo 


and 


Cew(@) = sup —w({x eR": Mg xe) > a). 
Ww 


0<w(E)<oo 


It was shown in [7] that the condition CuL,,(@) < +00 for some a € (0, 1) already 
implies that M : L?(w) > L?(w) for some 1 < p < o and, similarly if Cg (a) < 
+oo for some a € (0,1) then Mg : L?(w) > L?(w) for some 1 < p < ov. These 
results pose an important restriction on the kind of functions w we can consider 
in proving Hdlder regularity estimates for CyL,, and Cg,y. In particular, it is well 
known that the class of functions w such that M : L?(w) > L?(w) for some p € 
(1, co) is the Muchkenhoupt class of weights Ago; see for example [2]. Here we use 
the Fujii-Wilson definition of the Muckenhoupt class Ago. Namely, the weight w 
belongs to the class Ago if and only if 
[wla., ‘= su : 

= 2 w(Q) 


[ Movzo) < +00. 
Q 


where the supremum is taken with respect to all cubes in R” whose sides are parallel 
to the axes. This description of the class Ago goes back to Fujii [8], and Wilson, 
[10, 11]; see also [1]. Thus w € Ag is a necessary condition for the continuity of 
Cut.w on (0, 1). It turns out that w € Ag is also a sufficient condition for the Hélder 
regularity of Cui. 


Theorem 1.5 Let w € Ago be a Muckenhoupt weight on IR". Then 
Cr wie Comey (0,1), 


where the constant c, depends only on the dimension n. 


Moving to the multiparameter case, the condition that Mg : L?(w) > L?(w) 
for some p € (1,00) is equivalent to the condition w € A%,, where A3, denotes 
the class of multiparameter or strong Muckenhoupt weights. A few words about 
how the multiparameter Muckenhoupt class A%, is defined are in order here. For 
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x = (x,...,%) € R" and 1 < j < n we may associate the point # := 
(X1,.- 6X1, Aft + ++ Xn) € R’—!. Associated to a non-negative locally integrable 
function w on R” and x’ is the one-dimensional weight 


w(t) = W(X1,. +. Xft. EL Xj 1y +++ Xn), teR. 
Then [w],*, is defined by 


[Wlax, <= sup ess sup[w3]a..- 
l<j<n xeER"—! 


Here [v],4,, denotes the standard Fujii-Wilson A, constant of a weight v on R!, 
given by 


1 
(aos = SUP a, Mi(vyi). 


where the supremum is taken over all intervals J C IR and M, denotes the 
Hardy-Littlewood maximal operator on R!. Thus a weight w is a multiparameter 
Muckenhoupt weight if and only if [w]4x, < +00. We refer the reader to [6] and the 
references therein for more details on the definition and properties of multiparameter 
Muckenhoupt weights. 

With the definition of multiparameter Muckenhoupt weights in hand, the previ- 
ous discussion shows that a necessary condition for the continuity of Cg ,, on (0, 1) 
is that w € A%,. As in the one parameter case, we show that w € A®, is also sufficient 
for the Hélder continuity of Cg,, on (0, 1). 


Theorem 1.6 Let w € A%, be a multiparameter Muckenhoupt weight on R". Then 


Cs y = Callas (0, 1), 


where the constant c,, depends only on the dimension n. 


Notation 


We use the letters C,c to denote positive numerical constants whose value might 
change even in the same line of text. We express the dependence of a constant C on 
some parameter n by writing C,. We write A < Bif A < CB for some numerical 
constant C > 0. If A < C,B we then write A <,, B. In this note, w will always denote 
a non-negative, locally integrable function on R”. Finally, we say that a function f 
lies in the Hdlder class C’ (J) for some interval J C R if for every compact set K C J 
we have [f(x) —f(y)| Sx |x — yl? for all x,y € K. In this case we will say that f is 


locally Hélder continuous with exponent p in J. 
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Weighed Solyanik Estimates and Hélder Regularity 


In this section we show that the strategy for establishing Hédlder smoothness 
estimates for Cy_ (a) and Cg(a@) may be adapted to the weighted context. To imple- 
ment the above strategy, we need Solyanik estimates that provide us quantitative 
information as to how close Cy_(a@) and Cg(a) are to 1 for a near 1. Of course, 
the related estimates are expected to depend on w. Suitable Solyanik estimates in 
this regard were found in [5, 6] when w is a Muckenhoupt weight. In particular, we 
have the following: 


Theorem 3.1 (Hagelstein and Parissis [5, 6]) Let w € Ago. We have the Solyanik 


estimate 


Curwla) — 1S, Ad — 1) nblAco)! whenever 1>a@>1—e %l¥lco, 


wn 


Here A,, is the doubling constant of w, and c, and the implied constant depend only 
upon the dimension n. 


A multiparameter analogue of Theorem 3.1 the following. 


Theorem 3.2 (Hagelstein and Parissis [6]) Let w be a non-negative, locally 
integrable function in R". If w € AX, we have 


-1 ae re 
Cy y(@) —1 <p Aa) eo forall 1>a>1—e ads, 


~N 


where c > 0 is a numerical constant. 


With these weighted Solyanik estimates at our disposal we can now give the proof 
of the Hélder continuity estimates for Cy, and Cg,,,. 


Proof of Theorem 1.5 Let K be a compact subset in (0, 1) and let mx,Mx € (0, 1) 
be such that mx < x < M, for all x € K. Since w € Ag there exists some q € (0, 1) 
such that M : L?(w) > L&°°(w) and thus supyex Cutw(@) Swnx 1. Furthermore, 
by Theorem 3.1 we have that 


Cut.w(@) =I Swan (1 = ox) (all400)~™ forall 1>a>1- e7CnllAce =a. (1) 


We first consider x, y € K with0O < y—x < min( eek, sar m«) =: 7. We can 
then write 


Cuw() — CHL wy) = Cut) — CHL (x(1 gon 2 —s )). 


gntly 


Now observe that by our choice of x, y we have 


= tMe. 4 
grtiYOX < antl gS t= Me Sak 
x Qnt+l mK 
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We can thus apply Theorem 1.2 with x in the role of w := x and 6 := 2”! = to get 
Hp x(E) C He y(Hea—s)(E)) 


for all measurable E where here G6 denotes the collection of all cubes in R” whose 
sides are parallel to the axes. This immediately implies 


Cr) < CHrwO)CHLw (1 — 2°41), 


xX 


Thus we can estimate 


CH w(x) — CH wy) < Coty) [Cri.s(1 - ate) aa 1 


y-x 
Swank Cuiw(1 _ grt a =] 
since SUPyex CHL.w(@) Swn« 1. Noting that 
—* l-a 
ae = oe arr, ME > Wa; 


an appeal to (1) gives 


Y =X) (calwlAco)7! a 
Cutw(x) = Cutw(y) Swank (—): brlAco) SK (y = x) (nb ldoo) . 
We have shown that 


ICut.w(y) _ Cutw(x)| < 


~wwn,K 1. 


xyeK ly - | Callao) 


On the other hand, if x,y € K with y—x => n then the Hdlder estimate follows 
trivially since sup, yex ICHtw(x) — Cut)! Sw.n.x 1 So we are done. oO 


The proof of Theorem 1.6 is virtually identical to that of Theorem 1.5. 

One may naturally wonder how sharp the above smoothness estimates are for 
CuLw(a@) and Cg,,(a). In particular we may ask the questions: Are CyL,,(a@) and 
Cg.(a) differentiable on (0, 1)? Are they in fact smooth on (0, 1)? To the best of 
our knowledge, even the question of whether or not the sharp Tauberian constant 
Cui(a) of the Hardy-Littlewood maximal operator on R in the Lebesgue setting is 
differentiable constitutes an unsolved problem. All of these topics remain a subject 
of continuing research. 
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Three Observations on Commutators 
of Singular Integral Operators with BMO 
Functions 


Carlos Pérez and Israel P. Rivera-Rios 


Abstract Three observations on commutators of Singular Integral Operators with 
BMO functions are exposed, namely 


1. The already known subgaussian local decay for the commutator, namely 


le 2 : ITF xo)601 > M’PCAV}] < ce VI 
is sharp, since it cannot be better than subgaussian. 

2. It is not possible to obtain a pointwise control of the commutator by a finite sum 
of sparse operators defined by L log L averages. 

3. Motivated by the conjugation method for commutators, it is shown the failure of 
the following endpoint estimate, if w € A, \ A; then 


ae) 
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Introduction 


The purpose of this paper is to present some observations concerning commutators 
of singular integral operators with BMO functions. These operators were introduced 
by Coifman, Rochberg and Weiss in [6] as a tool to extend the classical factorization 
theorem for Hardy spaces in the unit circle to R”. These operators are defined by the 
expression 


Tatts) = | (b(s) — OKO) dy. ) 


where K is a kernel satisfying the standard Calder6n-Zygmund estimates and where 
b, the “symbol” of the operator, is a locally integrable function. Of course, these are 
special cases of the more general commutators given by the expression 


T, = [b, T| =M,0T—ToM, 


where T is any operator and M,j is the multiplication operator M)f = b-/f. 

The classical well known result from [6] establishes that [b, 7] is a bounded 
operator on L?(IR”), 1 < p < ov, when the symbol b is a BMO function. We state 
this result. 


Theorem 1.1 Let T be a singular integral operator and b a BMO function. The 
commutator T; is bounded on L?(R") for every 1 < p < ~@. 


In the same paper it is shown that b € BMO is also a necessary condition namely, 
if the commutators [b, Ril, j = 1,---,n of b with the Riesz transforms R; are 
bounded on L?(R") for some p € (1, 00) and every € {1,2,...,n}thenb € BMO. 

None of the different proofs of this result follows the usual scheme of the 
classical Calder6n-Zygmund theory for proving the L?(IR”) boundedness of singular 
integral operators T. Two proofs of Theorem 1.1 can be found in [6]. The first 
and main one in that paper is based on methods involving techniques similar to 
those used in [5] to understand the Calderén commutator. As far as we know this 
approach has not been so influential. However, the second proof, based on the so 
called conjugation method from operator theory, has been widely used. In fact, it 
is quite surprising that this proof was postponed to the end of the paper since it 
turns out to be highly interesting. Indeed, the method shows the intimate connection 
between these commutators and the A, theory of weights. Furthermore, this proof 
can be applied to general linear operators, not only for Singular Integral Operators. 
As a sample we will point out the following particular L? case: 


Theorem 1.2 Suppose that T is a linear operator such that 


T:L?(w) > L’(w) 
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for every w € Ap. Then for every b € BMO, 
[b, T] : L?(R") — L?(R"). 


The method of proof can be carried out in more generality as shown in [1]. The 
key initial argument of the proof is that we can write [b, T] as a complex integral 
operator using the Cauchy integral theorem as follows 


1 Tf) 


0 2m Sims 2 


d : 
[b, Tf = —e’T (fe) 
dz 


where 


22> Tf) = er( 5) ze, 
e 


This is called the “conjugation” of T by e” and the terminology comes most 
probably from group theory. Now, if || - || is a norm we can apply Minkowski 
inequality: 


1 
If. TIF = —— sup ITO e > 0. 


ldl=e 


The effectiveness of the method can be checked in the modern context of weighted 
L? estimates. Indeed, the method produces very optimal bounds of the operator norm 
as shown in [4] (see also [15]). 

This method reveals the role played by the following operation: 


fac eawr (“) 
Ww 


where w is a weight which, in this context, is an A, weight. Indeed, this is the case 
by the well known key property of the BMO class, if p > 1 and b € BMO then 
there is a small ¢9, such that e” € Ap, for any real number ¢ such that |¢| < e9. These 
operators were already studied by B. Muckenhoupt and R. Wheeden in the 1970s 
and by E. Sawyer in the 1980s. Some of the problems they left open were solved 
in [8]. A consequence of the main result of [8] is that if w € A; then 7, is of weak 
type (1, 1), namely 


I Ty llr (R") 11.9 (R") < © 


with bound depending upon the A, constant of w. However, we will exhibit examples 
of weights w € A, \ A; in section “Third Observation: The Failure of an Endpoint 
Estimate Motivated by the Conjugation Method” for which T,, is not of weak type 
(1, 1), namely 


Il Tw llc amy. ¢R") =o. 
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This shows that the case w € A, is specially relevant. Perhaps, this phenomenon can 
be explained by the fact that the conjugation method is closely attached to commu- 
tators with BMO functions which are not of weak type (1, 1) as observed in [25]. 
Indeed, the conjugation method works due to the property, already mentioned, that 
if p > 1 and b € BMO thene” € A, for small values of t. However, this property 
turns out to be false in the case p = 1. The lack of the weak type (1, 1) property for 
commutators is replaced by a Llog L inequality like (5) below and not better. 

There is another proof of Theorem 1.1 based on the use of the sharp maximal 
function of C. Fefferman and E. Stein which has also been very influential. It seems 
that it was first discovered by J. O. Str6mberg as mentioned by S. Janson in [17] (see 
also [28] pp. 417-419) The proof relies on combining the following key pointwise 
estimate 


M*([b, TIf) < cllbllamo (M, (Tf) + M, (f)) (2) 


where | < r,s < oo and M,(f) = M(|f|")'/” together with the classical Fefferman- 
Stein inequality: 


IM(P)Iv < clM*(P)Ile. 


Here we use standard notation, M is the Hardy-Littlewood maximal function and M# 
is the sharp maximal function. The L? boundedness of M and T yields the alternative 
proof of Theorem 1.1. Proceeding in the same way we obtain the corresponding 
estimates for A, weights. 

This approach was considered by S. Bloom in [2] extending in an interesting way 
Theorem 1.1 but only on the real line. 


Theorem 1.3 Let 1,A € Ap, and let H be the Hilbert transform: 


[>, H] : L?(u) — PPA) 


where v = yr Amp if and only if 
vy) = sup —— - Co. 
BMO(v) ra 10) Jo QO 


The power of the pointwise estimate (2) is reflected in many situations, for 
instance in [12], where similar results were derived for commutators of strongly 
singular integral with symbol in the new BMO class (3) (see also [13, 14] for an 
alternative approach based on dyadic shifts). 

However, estimate (2) is not sharp enough for many purposes and much better 
results can be obtained with the following variation: 


Mj ([b, TIf) < cllbllamo (M. (Tf) + M7 (f)) 0<8<e<1 (4) 
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where M? stands for Mo M (see [25]). Here, the key difference is that we are consid- 
ering small parameters 6 and ¢. The estimate is sharp since M? cannot be replaced 
by the (pointwise) smaller operator M. Indeed, otherwise these commutators would 
be of weak type (1, 1) but, as we mentioned above, this is not the case [25] where it 
is shown that commutators satisfy the following “ZL log L” type estimate, 


w({x ER" : |[b, TIF@)| > Ay») se fo (Toten) wdx 2>0, (5) 


where w € A;, ®(t) = tlog(e+7) and where c > O depends upon the A, 
constant. This shows that these commutators are “more singular” than Calder6n- 
Zygmund operators. The original proof of (5) follows from the key pointwise (4) 
combined with a good-A type argument, but an alternative proof was obtained by 
the first author and G. Pradolini in [26] with the bonus that non Ago weights can be 
considered. This argument is based on a variation of the classical scheme used to 
prove the weak type (1, 1) for Calderén-Zygmund operators. The statement of the 
result is the following. 


Theorem 1.4 Let T be a Calder6én-Zygmund operator and b € BMO. If w is an 
arbitrary weight the following inequality holds 


Mito 2 tow a) dx 


w(fe ER": |[b, TH) > A}) < Cor A . (Ito /S™) 


for every € > 0. 


Very recently (cf. [27]) the authors have obtained a quantitative version of 
the endpoint estimate for arbitrary weights, namely Theorem 1.4. This result is 
analogous to the one obtained by the first author and T. Hyténen for singular 
integrals in [15]. 


Theorem 1.5 Let T be a Calderén-Zygmund operator and b € BMO. If w = O isa 
weight then, for every ¢ > 0 


Cc 
w(fe eR" : [DTV > <5 i ® (ie M, (og 1)1+= WEE 
R"” 


The main novelty here is the appearance of the sharp factor 4 reflecting again 


the higher singularity of the operator. As a corollary of this result we can derive the 
following result obtained previously by C. Ortiz-Caraballo in [23], 


n “a 
w({x eR" : |[b, TIf@)| > AY) < C® (wlay? f ® (Uliano wdx. 
R" 

We remark that it seems that the conjugation method cannot be applied to prove 
this estimate. Therefore, estimate (5) or Theorem 1.4 works, so far, for Calderén- 
Zygmund operators not for general linear operators assuming a minimal appropriate 
weighted weak type estimate. 
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Another interesting difference between Calder6n-Zygmund operators and com- 
mutators concerns their local behavior. A very nice way of expressing this is by 
means of the following estimate due to Karagulyan [18]: there exists a constant 
c > 0 such that for each cube Q and for each function f supported on the cube Q 


ltr Q:|Tf)| > mMp@} <ce tO. (6) 


This result can be seen as an improvement of Buckley’s exponential decay theorem 
[3] which is a very useful result. For instance, it allows to improve in a quantitative 
way the classical good-A inequality between T and M: if p € (0,00) and w € Ago 


ZF loa < cer P Wha IMM low: 


Motivated by this result of Karagulyan, Ortiz-Caraballo, Rela and the first author 
developed a new method for proving (6) in [24]. This method is flexible enough 
to deal with other operators including the commutators. In particular, we have the 
following sub-gaussian estimate. 


Theorem 1.6 Let T be a Calderén-Zygmund operator and b € BMO, then there 
exists a constant c > 0 such that for each f 


ix € O: |[b, TIF x0) | > MP FOO}| <ceTVeMPllamo — tO. (7) 


SUP 5 


a 


We will show in section “First Observation: Sharpness of the Subexponential 
Local Decay” that this subexponential decay is fully sharp. In section “Second 
Observation: A “natural” but False Sparse Domination Result for Commutators”, 
we will provide a new proof of (6) based on the pointwise domination: if T is 
a Calderén- eee operator, then it is possible to find a finite set of y-sparse 
families {Sr (see section “Second Observation: A “natural” but False Sparse 
Domination Result for Commutators” for the definitions) contained in the same or 
in different dyadic lattices D; and depending on f such that 


3n 


IZf(x)| < er ) As f(x) (8) 
j=l 
where 
Asf(x) = If| xo(x). 
oe dX aH, on 


See section “Second Observation: A “natural” but False Sparse Domination Result 
for Commutators” for details, in particular Theorem 3.6. 
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In view of the interest of an estimate like (8) it would be relevant to produce a 
counterpart for commutators. The “natural” sparse operator for these commutators 
would be 


Bsf@) = > Wligogto XO: 
QES 


The reason that leads to consider this sparse operator in terms of the average 
ll - || Llog LO is due to the intimate relationship of commutators and M? which is 
an operator pointwise equivalent to M igars In section “Second Observation: A 
“natural” but False Sparse Domination Result for Commutators” we prove the 
impossibility of having a domination theorem for commutators by these “sparse” 
operators. 


First Observation: Sharpness of the Subexponential 
Local Decay 


We prove in this section that Theorem 1.6 is sharp, i.e., we can find a Calder6n- 
Zygmund operator T, a symbol b € BMO a function f and a cube Q such that 


ail EQ: |[b, TIf()| > M7F()}| = ce Ve Mllawo 


for some constant c > 0. More precisely we have the following. 


Observation 1 Let b(x) = log |x|, then we can find a constant c > 0 such that 
K€ 0.1) : [A Hon@)l > B= OV" 


where H stands for the Hilbert transform. 


Proof Let f(x) = x(0,1)(x). We are going to show that 
{x € 0,1) : |b. AY@)| > MFO} = Le € 0,1): IBA) > B1>ce7V" > 0. 


For x € (0, 1) we have that 


1y =i ! log(;) '/* Jog(4 
og(x) — log(y) — / Apis / g(+) ae 
c= 0 *-y o 1-t 


arcs) = | 


Now we observe that 


~ Dope "fe Leg y =e 
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1 
and since = is positive for (0, 1) U (1, co) we have for 0 < x < | that 
Jog) 
o.oo > fo Ear 
, _ 


Finally, a computation shows that 


'/x Jog (4 1\? 
i BO) ax (ioe -) x20. 
1 1l-—t x 


Consequently, we have that for some x9 < 1 


1\2 
\[b, HIf(x)| > ¢ (108 -) O0<x < Xx. 
x 
and then for some fo > 0, 


{x € (0,1): |[b, HIfG)| > o| = =e vile t>t (9) 


1\2 
{re (0,x9) tc (108 = > | 


as we wanted to prove. oO 


Second Observation: A “natural” but False Sparse 
Domination Result for Commutators 


Before stating the result we are going to prove in this section we need some notation. 
We borrow it from [21]. 


Definition 3.1 (Dyadic child) Let Q be a cube (with sides parallel to the axis). 
We call dyadic child any of the 2” cubes obtained by partitioning Q by n “median 
hyperplanes” (planes parallel to the faces of Q and dividing each edge into 2 equal 
parts). 


If we iterate the partition process of the preceding definition we obtain a standard 
dyadic grid D(Q) of subcubes of Q which has the usual properties: 


1. Foreachk = 0,1,2,... cubes in the k-th generation have sidelength 2-* and tile 
Q in a regular way. 

2. Each Q’ in the k-th generation has 2” children in the in the (k + 1)-th generation 
contained in it and one and only one parent in the (k—1)-th generation containing 
it (unless it is Q itself). 

3. If Q',O” € D(Q), then O'N CO” = Bor’ COE” orQ" cq. 

4. If QO’ € D(Q), then D(Q’) C D(Q). 
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Definition 3.2 (Dyadic lattice) A dyadic lattice D in R” is any collection of cubes 
such that 


(DL-1) If Qe€D then each dyadic child of Q is in D as well. 
(DL-2) If Q’,Q” € D there exists Q € D such that QO’, O” € D(Q). 
(DL-3) If K is acompact set of R” there exists Q € D such that K C Q. 


There is an easy way to build a dyadic lattice by considering a increasing 
sequence of dyadic cubes Q; such that U®,Q; = R". Then 


D=|)0@Q) 


j=l 


is a dyadic lattice. 


Definition 3.3 Let 7 € (0,1). We say that a family of cubes S C D is n-sparse if 
for each Q € S we can find a measurable subset E(Q) C Q such that: 


1. E(Q)’s are pairwise disjoint. 


2. n|Q| < |E(Q)| 


Definition 3.4 Let A > 1. We say a family of cubes S is A-Carleson if for every 
cube Q € D we have 


d> [PIS AIO. 


PES,PCO 


There is an interesting relation between Carleson and sparse families that we 
summarize in the following lemma that we borrow from [21] 


Lemma 3.5 /f S is a A-Carleson family of cubes then it is +-sparse. Conversely if 


S is a n-sparse family of cubes then it is a ;-Carleson family of cubes. 
Armed with all these definitions we can state the following pointwise domination 
theorem. 


Theorem 3.6 Let T be a Calderén-Zygmund operator. There is a finite set of 


_, contained in the same or in different dyadic lattices Dj 


n-sparse families {5}; 
and depending on f such that 


3n 


T*f(x) < crn Y As f(x) (10) 
j=l 
where Asf(x) = Does; ral fo fl xo(x). 


The proof of this result can be found in [21] and [7]. In [19] M. Lacey obtains 
the same estimate for Calderén-Zygmund operators that satisfy a Dini condition. 
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Recently a fully quantitative version of Lacey’s result was obtained in [16] and even 
more recently this quantitative version has been simplified in [20]. 

As a sample of the interest of this result we give a different proof of the 
exponential estimate (6): there exists a constant c > 0 such that for each cube Q 
and for each f supported on the cube Q 


giltreo: T*f(x) > tMf(x)}| < ce" t> 0, (11) 


To prove this result we will use the classical vector-valued extension of the 
maximal function introduced by Fefferman and Stein in [11] that can be written 
as follows: 


Mf) = (LMpeo)*) "= MFOI|q. 


j=l 


where f = {fj I is a vector—valued function. 

Taking into account that T* is controlled by a finite sum of sparse operators it 
suffices to establish (11) for those operators. Assume that suppf © Q for a certain 
cube. It is clear that we can find c, pairwise disjoint cubes Q; € D which union 
covers Q and such that |Q;| ~ |Q| We can assume those cubes to belong to any 
sparse family S C D, since it’s easy to check, taking into account Lemma 3.5, that 
adding a finite number of pairwise disjoints cubes to a sparse family the resulting 
family is again a sparse family. First we are going to prove that if S is and Q; € S 
with |Q;| ~ |Q|, as we have just showed that we can assume, then 


: xEQ: S) xp(x)>tp| <ce™. (12) 


|O| PES, PCO; 


We begin observing that 


xeQ: >> xpQ@)>t 


[Q| PES, PSO; 


ae xEONQ;: > XrP(x) >t 


Jal PES, PCQ) 


x€Q;): SY) xp(x) >t] = Co. 


1 
€ 
IQ\| PES, PCO; 
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We use now one of the key estimates from [24]. Indeed, let {E(P)}pes, pcg; be the 
family of sets from Definition 3.3. We have then for some c > 0 that 


q 
Yvw => (a 1) yo) 


PES QES 


q 
<c DO (jee) 10) 


PES, PSO; 


1 q 
c », (a [ remoya) XP(x) 


PES, PSO; 


c (™, ({xe@}res,eco,) (y)" 


. (M,g)(x))* , 


IA 


IA 


where g; = {XE®)} pes, Po, is supported in Q;. Now, since {E(Q)}pes, pcg; is a 
pairwise disjoint family of subsets, we have that for any j 
I/q 
lgi@le =| So (xz@@)*] <1. (13) 


PES, PCO; 


We finish the proof of (12) recalling that if |gj|e € L°, then (M,g;(x))" € ExpL 
(see [11]) from which we conclude that: 


Co < ce, t>0. 


Now we go back to the proof of the estimate. We first observe that 


1 
a {x € QO: Asf(x) > Mf (x)}| 


Cn 


1 
< erties 
<Lig 


{+ €0:As (xa) () > Lmcrro)co} | 


Hence it suffices to obtain an estimate for each term of the sum. First we may assume 
that |Q;  Q| # 0 since otherwise, ff Xo; = 0 for every measurable set and the 
corresponding term in the sum equals zero. Now we split As(f79,) as follows 


1 1 
Acro) = Ly Frote®)+ Ya : Froxe). (14) 


PES, PCO; PES, P20; 
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We observe that for the first term 


Pes, PCO; al Sof xojxe) < > XP(x) 
_— P . 
M (fx0,) (x) PES, PCO) 


For the second term, we have that that since 0; Q € @ and |Q| ~ |Q\| then 
OQ C 5Q;. Consequently, 


1 
Des, P29; TH pf x0; xP) 7 di Pes, P20; IPI Jo,f xP) 


M(fx0;)() = 1 
a 15Q)| i ot 
|Q)| 
< Ch > Vaal (x) 
PES, PDQ; 
| 
= Ch y nk’ 
k=0 
Then, combining the estimates obtained for each of the terms of (14), we have that 
1 t 
ral xe O07 Ag (Fx0;) (x) > ~ MOfxo) 0) 
Pr. aor. oe 3s 1 
— (x : x ——-C, — 
“jal aaa rk 


PES, PSO; tb k=0 


and the desired conclusion, namely (11), follows from (12). 


Observation 2 Let T be a Calderén-Zygmund operator and b € BMO. It is 
not possible to find a finite set of n-sparse families {Sr with N dimensional, 
contained in the same or in different dyadic lattices D; and depending on f such 
that 


N 
[b.TV@| < cord Bsf@) ae.xeR" (15) 
j=l 
where Bsf (x) = Does; All toe z.0 Xo(%). 


We are going to give two proofs of this result. The first one is based on the Rubio 
de Francia algorithm. 


Proof 1 Suppose that (15) holds, then we can prove the following L! inequality 


>. TH llctoy < cba, IMF lle) (16) 
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Indeed, 


I| [b. TH \ln'qy) S Cb,T 3 Bs flew 


j=l 


ses Ueno roel? 


J=1 QES; 


‘ 


um a3 y Vigo or \E(0)| 


J=1 QES; 


ig 


“yy f Mog tt F(x) Mw(x)dx 


J=1 QES; 


Ch. 
= rr MF llaiwys 


since M2 x M 


: a : 
Diogt* We claim now the L’ version, 


Ib. Ty lle) < cnpllM*fllvey p> 1. (17) 
Indeed, by duality we can find g > Oin L”’ (R") with unit norm such that 
If. Tiflvcn = flo. THCdIecoae. 
R” 


We consider the Rubio de Francia algorithm 


ee] k 
M"(g) 
Re= ) —_ 
2 4 ME, R") 


It’s a straightforward computation that R(g) is an A; weight with constant 
[Rela = 2||M|\,," < Cnp 
and also that g < Rg and |[Rgl|)>" < 2|[8 lp’ qn = 2. Then have that 
[io.rreleear s [ 1b. TcoIRewas 
Re" Re 

and using (16) and Hélder inequality 

[ le. nyeiReeyar = clRely, [ MFCORECa 

R" R" 
sop [ M°Fo)ReC)ax = cplMFlrce IRelly cn 
R” 


< cp\|M*f lve: 
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Hence (17) is established. Now since 
2 
||M? llvcany < en (P') p> 
we have that 
” 
I[b. Tle) < cp (p’) p>tl (18) 


Now let us observe that if we take [b, H]f with b(x) = log |x| and f(x) = xo) 
then 


IIIb. Ail = ep? p> 1, 


and this leads to a contradiction when p — oo. To prove this lower estimate we use 
estimate (9) from Theorem 1. Indeed, for some fp > 0 


1 
I>. Alf lle = | [b. Af llc = mp Ae ER: |[b, AIf@)| > Bl? 
t 


ce OF 
> supt|{x € (0, x9) : c(loe -) >t 
t>to Xx 
xvi 
=> suptce ? = cp’ toe ¥® 
t>to 
and this concludes the first proof. Oo 


For the second proof we will rely on the sharpness result that was settled in the 
previous section. 


Proof 2 Assume again that (15) holds. Then, for some c > 1 


{x €Q:|[b, TY C)| > iM? (x)}| < 


N 
EQ: be py If llr og r,pXPO) > suena} | 


j=l PES; 


It will be enough for our purposes to work on each term of the inner sum, namely to 
control 


XE OD) Uf ll goes pXP@) > MF) 


PES; 
Now, recalling that M*f ~ M «jJ> 18 not hard to see that essentially the same 


argument we used to prove (11) yields that 


1 aly 
ial xEQ: > All op, pXPO) > tM f(x) p| <ce™. 
Q PES; 
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Combining the preceding estimates we arrive to 


a |{x € Q: \[b, TIf(x)| > tM’ (x)}| < <ce™ t>0 


which is a contradiction by Observation 1. oO 
The correct pointwise control for the commutator seems to be the following one 


Conjecture 1 Let T be a Calderén-Zygmund operator and b € BMO. Then 


N 
|[>. TF) < CM, T)[Ibllamo D> As, (Asf) @) 


ij=l 
where As f(x) = oes; a fo If) | dyxo(x) and the sparse families S; are not 
necessarily subfamilies of the same dyadic lattice. 


If this conjecture holds it would be very easy to recover the main theorem from 
[4] since it suffices to iterate the following estimate: 


1 
nex 


Dw) = < Cap [wl,, 


|Asf fll 


which was studied in [9, 10] (see also [21]). 


Third Observation: The Failure of an Endpoint Estimate 
Motivated by the Conjugation Method 


In this section we consider the following family of operators: 


f > Ty) := wT (4) (19) 


where w is a weight and T is a Calder6én-Zygmund operator. We already mentioned 
in the introduction that these operators are of interest since they are very much 
related to commutators due to the conjugation method. We emphasized that the case 
w € A, is special since 7, is of weak type (1,1) as a consequence of the main 
results from [8]. Understanding the case w € A, would be more interesting due to 
its connection with the conjugation method. However, 7, is not of weak type (1, 1) 
in general since there are weights w € A, \ A; for which 


I Twlleaany oR") =H, 
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being the purpose of this section to show the existence of such weights. In fact we 
are going to show something worst replacing T by the less singular operator M. 


Observation 3 Let 1 < p < 00, then there is w € A, \ A, such that 
[Mule aaryszicocgny = 0. 


Proof In dimension 1 we choose the A, weight w(x) = |x|~°"-) with 6 € 
(0. min i. I) and f = 10,1) so that, f € L'(w). We prove that 


lao 


Indeed, a computation shows that for x > 1 


X 
m (222) @ > = 2 a 


with 8 = 1 + 6(1 — p) and then 


1 
1 1 1\8 
jw (£) > — supr|{x>1: x Ot stl = sup t (; ) —l]=00 
W J |IL1.0c0(R) B iso B 15150 t 


since B € (0,1). Oo 


= ow. 
L1.c°(R) 


An interesting question is to find a necessary and sufficient condition for the 
boundedness of this operator, namely, characterize the weights w for which 


|, 1 @n—L.0(R") < Ow. 


In [22] Muckenhoupt and Wheeden proved that this inequality holds for w € A, in 
the real line and also obtained a necessary condition on the weights, namely 


< cw(x) ae.x € R"” 
Loo (R") 


| wo 
| -—x|" 


but we don’t know whether is sufficient or not. 
To end this section we show that can go further and prove a negative result for 
possible L log L type estimates. 


Observation 4 Let 1 < p < ov, and let P(t) = t log(e + t)*, a > 0. Then we can 
find w € Ap \ A, and f such that there’s no c > 0 for which 


eer": wa (4) > <e/ 0 (a (20) 
WwW n t 
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Proof As above we do it for the case n = 1. We assume the contrary, namely there 
is a finite constant c > 0 such that (20) holds for any nonnegative f. Let f = 7,1). 
For this choice of f the right hand side of (20) equals ® (4) and we have that 


1 ; X01) 


Choose again the A, weight w(x) = |x|") with & € (0. min {1, 41). 


Proceeding and using the same notation as in the proof of Observation 3 we have 
that 


1 
1 1 1\4 
sup —— |freR : wm (A22) > tl > c sup ——— (;) -1 
1>0 (1) w 0<t<1 o(1) t 
1 
t 1\28 
=c sup ; -|(+) i| =o. 
0<t<1 log (e+ 1) 


since B € (0, 1). Oo 
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A Two Weight Fractional Singular Integral 
Theorem with Side Conditions, Energy 
and k-Energy Dispersed 


Eric T. Sawyer, Chun-Yen Shen, and Ignacio Uriarte-Tuero 


This paper is dedicated to the memory of Cora Sadosky and her 
contributions to the theory of weighted inequalities and her 
promotion of women in mathematics. 


Abstract This paper is a sequel to our paper Sawyer et al. (Revista Mat Iberoam 
32(1):79-174, 2016). Let o and w be locally finite positive Borel measures on 
R” (possibly having common point masses), and let T% be a standard a-fractional 
Calderén-Zygmund operator on R” with 0 < a <n. Suppose that Q : R” > R’ is 
a globally biLipschitz map, and refer to the images QQ of cubes Q as quasicubes. 
Furthermore, assume as side conditions the A} conditions, punctured AJ conditions, 
and certain a-energy conditions taken over quasicubes. Then we show that T® is 
bounded from L? (a) to L? (w) if the quasicube testing conditions hold for T* and 
its dual, and if the quasiweak boundedness property holds for T®. 

Conversely, if T* is bounded from L? (co) to L?(w), then the quasitesting 
conditions hold, and the quasiweak boundedness condition holds. If the vector of 
a-fractional Riesz transforms R* (or more generally a strongly elliptic vector of 
transforms) is bounded from L? (a) to L? (w), then both the AS conditions and the 
punctured A5 conditions hold. 
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Our quasienergy conditions are not in general necessary for elliptic operators, 
but are known to hold for certain situations in which one of the measures is one- 
dimensional (Lacey et al., Two weight inequalities for the Cauchy transform from R 
to C, arXiv:1310.4820v4; Sawyer et al., The two weight T1 theorem for fractional 
Riesz transforms when one measure is supported on a curve, arXiv:1505.07822v4), 
and for certain side conditions placed on the measures such as doubling and k- 
energy dispersed, which when k = n—1 is similar to the condition of uniformly full 
dimension in Lacey and Wick (Two weight inequalities for the Cauchy transform 
from R to C_, arXiv:1310.4820v1, versions 2 and 3). 


Introduction 


The boundedness of the Hilbert transform Hf (x) = Te ay on the real line R in 


the Hilbert space L? (R) has been known for at least a century (perhaps dating back 
to A & E'): 


ZF llezm Sifleze, fe LR). (1) 


This inequality has been the subject of much generalization, to which we now turn. 


A Brief History of the T1 Theorem 


The celebrated T1 theorem of David and Journé [3] extends (1) to more general 
kernels by characterizing those singular integral operators T on R” that are bounded 
on L?(R"), and does so in terms of a weak boundedness property, and the 
membership of the two functions T1 and 7*1 in the space of bounded mean 
oscillation, 


| 
= 


ITUlsmoceny 5 Ill]. ¢eny = 


| 
= 


IT" Ulamowey S Wleco@) = 


These latter conditions are actually the following testing conditions in disguise, 


IMolle@y S Melle@y = vial 


IT*Lolla@y S ellie) = viel. 


'Peter Jones used A&E to stand for Adam and Eve. 
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tested uniformly over all indicators of cubes Q in R” for both T and its dual 
operator T*. This theorem was the culmination of decades of investigation into the 
nature of cancellation conditions required for boundedness of singular integrals.” 

A parallel thread of investigation had begun even earlier with the equally 
celebrated theorem of Hunt, Muckenhoupt and Wheeden [4] that extended (1) 
to measures more general than Lebesgue’s by characterizing boundedness of the 
Hilbert transform on weighted spaces L* (R;w). This thread culminated in the 
theorem of Coifman and Fefferman* [2] that characterizes those nonnegative 
weights w on R” for which all of the ‘nicest’ of the L* (R") bounded singular 
integrals T above are bounded on weighted spaces L? (IR"; w), and does so in terms 
of the A, condition of Muckenhoupt, 


1 1 1 
— x) dx — dx) <1, 
(5 fw Veal ae ") < 


taken uniformly over all cubes Q in R”. This condition is also a testing condition in 
disguise, in particular it follows from 


1 1 
W/ IIL (Rew) W I r2(R"3w) 
ee . . a5 a £(Q) . n 
tested over all ‘indicators with tails’ sg (x) = @+h0| of cubes Q in R”. 


A natural synthesis of these two threads leads to the ‘two weight’ question 
of characterizing those pairs of weights (o,w) having the property that nice 
singular integrals are bounded from L? (R";o) to L? (R";@). Returning to the 
simplest (nontrivial) singular integral of all, namely the Hilbert transform Hf (x) = 
[= (dy on the real line, Cotlar and Sadosky gave a beautiful function theoretic 
characterization of the weight pairs (o,) for which H is bounded from L? (IR; 0) 
to L? (R;@), namely a two-weight extension of the Helson-Szegé theorem. This 
characterization illuminated a deep connection between two quite different function 
theoretic conditions, but failed to shed much light on when either of them held. 
On the other hand, the two weight inequality for positive fractional integrals, 
Poisson integrals and maximal functions were characterized using testing conditions 
by one of us in [24] (see also [6]) and [23], but relying in a very strong way on 
the positivity of the kernel, something the Hilbert kernel lacks. In light of these 
considerations, Nazarov, Treil and Volberg formulated the two weight question for 
the Hilbert transform [35], that in turn led to the following NTV conjecture: 


>See e.g. chapter VII of Stein [34] and the references given there for a historical background. 
3See e.g. chapter V of [34] and the references given there for the long history of this investigation. 


308 E.T. Sawyer et al. 


Conjecture 1 [35] The Hilbert transform is bounded from L?(R";0) to 
L? (R"; @), ie. 


IZ Fol zae) S Wllaw)» fel (Re), (2) 


if and only if the two weight Az condition with two tails holds, 


(aft) (iq feo) 52 


uniformly over all cubes Q, and the two testing conditions hold, 


7100 |e cer0) S WMollzcer6) = VIle- 
"1 90llp2@er:5) S Molli2er;0) = VIQlo 


uniformly over all cubes Q. 


In a groundbreaking series of papers including [19, 20] and [21], Nazarov, Treil 
and Volberg used weighted Haar decompositions with random grids, introduced 
their ‘pivotal’ condition, and proved the above conjecture under the side assumption 
that the pivotal condition held. Subsequently, in joint work of two of us, Sawyer 
and Uriarte-Tuero, with Lacey [16], it was shown that the pivotal condition was not 
necessary in general, a necessary ‘energy’ condition was introduced as a substitute, 
and a hybrid merging of these two conditions was shown to be sufficient for use 
as a side condition. Eventually, these three authors with Shen established the NTV 
conjecture in a two part paper; Lacey, Sawyer, Shen and Uriarte-Tuero [13] and 
Lacey [8]. A key ingredient in the proof was an ‘energy reversal’ phenomenon 
enabled by the Hilbert transform kernel equality 


having the remarkable property that the denominator on the right hand side remains 
positive for all y outside the smallest interval containing both x and x’. This proof of 
the NTV conjecture was given in the special case that the weights o and w had no 
point masses in common, largely to avoid what were then thought to be technical 
issues. However, these issues turned out to be considerably more interesting, and 
this final assumption of no common point masses was removed shortly after by 
Hyténen [6], who also simplified some aspects of the proof. 

At this juncture, attention naturally turned to the analogous two weight inequal- 
ities for higher dimensional singular integrals, as well as a-fractional singular 
integrals such as the Cauchy transform in the plane. In a long paper begun in [28] 
on the arXiv in 2013, and subsequently appearing in [30], the authors introduced the 
appropriate notions of Poisson kernel to deal with the AJ condition on the one hand, 
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and the w-energy condition on the other hand (unlike for the Hilbert transform, these 
two Poisson kernels differ in general). The main result of that paper established the 
T1 theorem for ‘elliptic’ vectors of singular integrals under the side assumption 
that an energy condition and its dual held, thus identifying the culprit in higher 
dimensions as the energy conditions. A general T1 conjecture is this (see below for 
definitions). 


Conjecture 2 Let T*” denote an elliptic vector of standard a-fractional singular 
integrals in R". Then T®" is bounded from L? (R"; 0) to L? (IR"; @), i.e. 


T°" (Fo) Il r2@@”:09) > IL ll 22 cens0) » Je L (R"; 0), (3) 
if and only if the two one-tailed A conditions with holes hold, the punctured A 
conditions hold, and the two testing conditions hold, 


T°" 100 || 2¢@ne) = ILollz2cgnsc) = y|Q|,. 
wn,dual any 
T°" Me 1g | ea geeenes s Lollz2@ene) J | oe 


for all cubes Q in R" (whose sides need not be parallel to the coordinate axes). 


In [32], the authors have recently shown that the energy conditions are not 
necessary for boundedness of elliptic vectors of singular integrals in general, but 
have left open the following conjecture, which in view of the aforementioned main 
result in [30], would yield the 71 theorem for gradient elliptic operators. An elliptic 
a-fractional singular integral vector T*” in R” is said to be gradient elliptic if both 
|VeK® (x,y)] 2 pe yl"! and [Vy Ke (x,9)| B be — yl 


Conjecture 3 Let T*” denote a gradient elliptic vector of standard a-fractional 
singular integrals in R". If T*" is bounded from L? (IR"; a) to L? (R"; @), then the 
energy conditions hold as defined in Definition 2.6 below. 


While the energy conditions are not necessary for elliptic operators in general 
[32], there are some cases in which they have been proved to hold. Of course, they 
hold for the Hilbert transform on the line [16], and in recent joint work with M. 
Lacey and B. Wick, the five of us have established that the energy conditions hold 
for the Cauchy transform in the plane in the special case where one of the measures 
is supported on either a straight line or a circle, thus proving the 71 theorem in this 
case. The key to this result was an extension of the energy reversal phenomenon 
for the Hilbert transform to the setting of the Cauchy transform, and here the 
one-dimensional nature of the line and circle played a critical role. In particular, 
a special decomposition of a 2-dimensional measure into ‘end’ and ‘side’ pieces 
played a crucial role, and was in fact discovered independently in both [26] and 
[17]. A further instance of energy reversal occurs in our 71 theorem [31] when one 
measure is compactly supported on a C! curve in R’. 

The paper [18, v3] by Lacey and Wick overlaps both our paper [30] and this 
paper to some extent, and we refer the reader to [30] for a more detailed discussion. 
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Finally, we mention an entirely different approach to investigating the two weight 
problem that has attracted even more attention than the T1 approach we just 
described. Nazarov has shown that the two-tailed A> condition of Muckenhoupt 
(see below) is insufficient for (3), and this begs the question of strengthening the 
Muckenhoupt condition enough to make it sufficient for (3). The great advantage 
of this approach is that strengthened Muckenhoupt conditions are generally ‘easy’ 
to check as compared to the highly unstable testing conditions. The disadvantage 
of course is that such conditions have never been shown to characterize (3). The 
literature devoted to these issues, beginning with that of Pérez [22], and continuing 
more recently with work of many groups involving, among others, D. Cruz-Uribe, 
M. Lacey, A. K. Lerner, J. M. Martell, F. Nazarov, C. Pérez, A. Reznikov and A. 
Volberg, is both too vast and too tangential to this paper to record here, and we 
encourage the reader to search the web for more on ‘bumped-up’ Muckenhoupt 
conditions.* 

This paper is concerned with the T1 approach and is a sequel to our first paper 
[30]. We prove here a two weight inequality for standard a-fractional Calderén- 
Zygmund operators T* in Euclidean space R”, where we assume n-dimensional 
A conditions (with holes), punctured AS?" conditions, and certain w-energy 
conditions as side conditions on the weights (in higher dimensions the Poisson 
kernels used in these two conditions differ). The two main differences in this 
theorem here are that we state and prove® our theorem in the more general setting 
of quasicubes (as in [28]), and more notably, we now permit the weights, or 
measures, to have common point masses, something not permitted in [30] (and 
only obtained for a partial range of a in [18, version 3]). As a consequence, we 
use 5 conditions with holes as in the one-dimensional setting of Hytonen [6], 
together with punctured AS?"" conditions, as the usual A% ‘without punctures’ fails 
whenever the measures have a common point mass. The extension to permitting 
common point masses uses the two weight Poisson inequality in [24] to derive 
functional energy, together with a delicate adaptation of arguments in [28]. The 
key point here is the use of the (typically necessary) ‘punctured’ Muckenhoupt 
AS" conditions below. They turn out to be crucial in estimating the backward 
Poisson testing condition later in the paper. We remark that Hyt6nen’s bilinear 
dyadic Poisson operator and shifted dyadic grids [6] in dimension n = 1 can be 
extended to derive functional energy in higher dimensions, but at a significant cost 
of increased complexity. See the previous version of this paper on the arXiv for 
this approach,°® and also [18] where Lacey and Wick use this approach. Finally, 
we point out that our use of punctured Muckenhoupt conditions provides a simpler 
alternative to Hytoénen’s method of extending to common point masses the NTV 


4Starting e.g. with the recent articles [1] and [10]. 
Very detailed proofs of all of the results here can be found on the arXiv [29]. 


6 Additional small arguments are needed to complete the shifted dyadic proof given there, but we 
omit them in favour of the simpler approach here resting on punctured Muckenhoupt conditions 
instead of holes. The authors can be contacted regarding completion of the shifted dyadic proof. 
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conjecture for the Hilbert transform [6]. The Muckenhoupt A> conditions (with 
holes) are also typically necessary for the norm inequality, but the proofs require 
extensive modification when quasicubes and common point masses are included. 

On the other hand, the extension to quasicubes in the setting of no common point 
masses turns out to be, after checking all the details, mostly a cosmetic modification 
of the proof in [30], as demonstrated in [28]. The use of quasicubes is however 
crucial in our T71 theorem when one of the measures is compactly supported on a 
C! curve [31], and this accounts for their inclusion here. 

We also introduce a new side condition on a measure, that we call k-energy 
dispersed, which captures the notion that a measure is not supported too near a 
k-dimensional plane at any scale. When 0 < @ < n is appropriately related to k, 
we are able to obtain the necessity of the energy conditions for k-energy dispersed 
measures, and hence a T1 theorem for strongly elliptic operators T*. The case 
k = n—1 is similar to the condition of uniformly full dimension introduced in 
[18, versions 2 and 3]. 

We begin by recalling the notion of quasicube used in [28] - a special case of the 
classical notion used in quasiconformal theory. 


Definition 1.1 We say that a homeomorphism Q : R” — R” is a globally 
biLipschitz map if 


Q —Q 
Joly = ap LO-LOM 
x,yER" \|x — yl] 


oo, (4) 


and 2 <O. 


-1 
la 
Note that a globally biLipschitz map Q is differentiable almost everywhere, and 

that there are constants c, C > 0 such that 


c < Jag (x) = |detDQ (x)| < C, x eR’. 


Example 1.2 Quasicubes can be wildly shaped, as illustrated by the standard 
example of a logarithmic spiral in the plane f,(z) = z la = zeleln&) | Indeed, 
fe : C > C is a globally biLipschitz map with Lipschitz constant 1 + Ce since 
fe (w) = whol and 


a é a é éi . eis & ei 
Vfe = (F. =) = (ic? + ie |z|?" , ie= |z|? . 
dz OZ z 
On the other hand, f; behaves wildly at the origin since the image of the closed unit 

interval on the real line under f; is an infinite logarithmic spiral. 


Notation 1 We define P” to be the collection of half open, half closed cubes in 
IR" with sides parallel to the coordinate axes. A half open, half closed cube Q 
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in R" has the form Q = Q(c,£) = I] [ck - Lock + ) for some £ > 0 and 
k=1 

Cc = (C},..-,€n) € R". The cube Q(c,£) is described as having center c and 

sidelength €. 


We repeat the natural quasi definitions from [28]. 
Definition 1.3. Suppose that Q : R” — R” is a globally biLipschitz map. 


1. If E is a measurable subset of IR”, we define QE = {Q (x) : x € E} to be the 
image of FE under the homeomorphism (2. 


(a) In the special case that E = Q is a cube in R", we will refer to QQ as a 
quasicube (or (2-quasicube if Q is not clear from the context). 

(b) We define the center cog = c (QQ) of the quasicube QQ to be the point Qceg 
where cg = c (Q) is the center of Q. 

(c) We define the side length £ (QQ) of the quasicube QQ to be the sidelength 
£ (Q) of the cube Q. 

(d) For r > 0 we define the ‘dilation’ rQQ of a quasicube QQ to be QrQ where 
rQ is the usual ‘dilation’ of a cube in R” that is concentric with Q and having 
side length ré (Q). 


2. If K is a collection of cubes in R", we define QK = {QQ: Q € K} to be the 
collection of quasicubes QQ as Q ranges over K. 

3. If Fis a grid of cubes in R”, we define the inherited quasigrid structure on QF 
by declaring that QQ is a child of QQ’ in QF if Q is a child of Q’ in the grid F. 


Note that if QQ is a quasicube, then |QO|" ~ |Q|" = £(Q) = (QQ). 
For a quasicube J = QQ, we will generally use the expression |/ | in the 
various estimates arising in the proofs below, but will often use £ (J) when defining 
collections of quasicubes. Moreover, there are constants Rpjg and Rymay Such that we 
have the comparability containments 


Q + Qx9 Cc RpigQQ and Rsmat2Q Cc Q a Qxo. 


Given a fixed globally biLipschitz map @ on R”, we will define below the 
n-dimensional A conditions (with holes), punctured Muckenhoupt conditions 
ASP" testing conditions, and energy conditions using Q-quasicubes in place of 
cubes, and we will refer to these new conditions as quasiA5, quasitesting and 
quasienergy conditions. We will then prove a T1 theorem with quasitesting and 
with quasiA; and quasienergy side conditions on the weights. Since quasiA; N 
quasiASP'™" = ASM ASP" (see [31]), we usually drop the prefix quasi from the 
various Muckenhoupt conditions (warning: quasiA; # Aj). 

Since the A> and punctured Muckenhoupt conditions typically hold, this identi- 
fies the culprit in higher dimensions as the pair of quasienergy conditions. We point 
out that these quasienergy conditions are implied by higher dimensional analogues 
of essentially all the other side conditions used previously in two weight theory, in 
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particular doubling conditions and the Energy Hypothesis (1.16) in [16], as well as 
the condition of k-energy dispersed measures that is introduced below. This leads to 
our second theorem, which establishes the 71 theorem for strongly elliptic operators 
T* when both measures are k-energy dispered with k and aw appropriately related. 

It turns out that in higher dimensions, there are two natural “Poisson integrals’ 
P* and ?* that arise, the usual Poisson integral P” that emerges in connection 
with energy considerations, and a different Poisson integral P* that emerges in 
connection with size considerations. The standard Poisson integral P® appears in 
the energy conditions, and the reproducing Poisson integral P* appears in the A} 
condition. These two kernels coincide in dimension n = | for the caseea = 0 
corresponding to the Hilbert transform. 


Statements of Results 


Now we turn to a precise description of our main two weight theorem. 


Assumption: We fix once and for all a globally biLipschitz map Q : R” > R"” 
for use in all of our quasi-notions. 


We will prove a two weight inequality for standard qa-fractional Calderén- 
Zygmund operators T* in Euclidean space R”, where we assume the n-dimensional 
A conditions, new punctured A5 conditions, and certain w-quasienergy conditions 
as side conditions on the weights. In particular, we show that for positive locally 
finite Borel measures o and w in R”, and assuming that both the quasienergy 
condition and its dual hold, a strongly elliptic vector of standard o-fractional 
Calderén-Zygmund operators T” is bounded from L? (c) to L? (w) if and only if 
the A} condition and its dual hold (we assume a mild additional condition on the 
quasicubes for this), the punctured Muckenhoupt condition AS?" and its dual 
hold, the quasicube testing condition for T% and its dual hold, and the quasiweak 
boundedness property holds. In order to state our theorem precisely, we define these 
terms in the following subsections. 


Remark 2.1 It is possible to collect our various Muckenhoupt and quasienergy 
assumptions on the weight pair (o,@) into just two compact side conditions of 
Muckenhoupt and quasienergy type. We prefer however, to keep the individual 
conditions separate so that the interested reader can track their use below. 


Standard Fractional Singular Integrals and the Norm Inequality 


Let 0 < a@ < n. We define a standard qa-fractional CZ kernel K%(x, y) to be a 
function defined on R” x R” satisfying the following fractional size and smoothness 
conditions of order 1 + 6 for some 6 > 0, 
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|K* (x,y)| < Ccz|x—yl? and |VK® (x, y)| < Cez|x-yl2" |») 


5 
=) Ix—y|1, jee al 
ay 


4 -_ a (,! 
| Vk* (x,y) — VK (2 <Cen( |z—y| ~ 2” 


and the last inequality also holds for the adjoint kernel in which x and y are 
interchanged. We note that a more general definition of kernel has only order 
of smoothness 6 > 0, rather than 1 + 6, but the use of the Monotonicity and 
Energy Lemmas below, which involve first order Taylor approximations to the 
kernel functions K® (-, y), requires order of smoothness more than 1. 


Defining the Norm Inequality 


We now turn to a precise definition of the weighted norm inequality 
ITSfllizw < Me Wllaw, Fel). (6) 


For this we introduce a family {ng | of nonnegative functions on [0, co) 
"I 0<b<R<oo 


so that the truncated kernels Ky’, (x,y) = 1§ p (|x — y|) K® (x,y) are bounded with 
compact support for fixed x or y. Then the truncated operators 


Tys.rt (x) = [ Kir @ vf )doQ), xeR", 


are pointwise well-defined, and we will refer to the pair (Kz, {ng an ; ) 
: <d<R<oo 


as an a-fractional singular integral operator, which we typically denote by T%, 
suppressing the dependence on the truncations. 


Definition 2.2 We say that an a-fractional singular integral operator 


r= (a {ng F| ) satisfies the norm inequality (6) provided 


0<5<R<oo 
Tose low <M lle, FEL? ),0<8<R<oo. 


It turns out that, in the presence of Muckenhoupt conditions, the norm 
inequality (6) is essentially independent of the choice of truncations used, and 
we now explain this in some detail. A smooth truncation of T® has kernel 
s.r (|x — y|) K® (x,y) for a smooth function 7ns,p compactly supported in (6, R), 
0 <6 < R < o, and satisfying standard CZ estimates. A typical example of an 
a-fractional transform is the a-fractional Riesz vector of operators 


R@ =k" <leaten), Veaw<7 
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11,00 


The Riesz transforms R;“ are convolution fractional singular integrals R;“f = 
Ky" * f with odd kernel defined by 


£ 
an _ Ww _ Qe (w) 1 
ie pw fttime fy” w= (wi....w"), 


However, in dealing with energy considerations, and in particular in the Mono- 
tonicity Lemma below where first order Taylor approximations are made on the 
truncated kernels, it is necessary to use the tangent line truncation of the Riesz 
transform Rf” whose kernel is defined to be Q¢ (w) Wp (\w|) where Ws'p is 
continuously differentiable on an interval (0,5) with 0 < 56 < R < S, and where 
Vip lr) = "if 6 < r < R, and has constant derivative on both (0, 6) and (R, S) 
where ys‘, (S) = 0. Here S is uniquely determined by R and a. Finally we set 
Ws‘ p (0) = 0 as well, so that the kernel vanishes on the diagonal and common point 


masses do not ‘see’ each other. Note also that the tangent line extension of a C! 


function on the line is again C!* with no increase in the C!* norm. 

It was shown in the one dimensional case with no common point masses in [13], 
that boundedness of the Hilbert transform H with one set of appropriate truncations 
together with the A} condition without holes, is equivalent to boundedness of 
H with any other set of appropriate truncations. We need to extend this to R®” 
and more general operators in higher dimensions and to permit common point 
masses, so that we are free to use the tangent line truncations throughout the 
proof of our theorem. For this purpose, we note that the difference between the 
tangent line truncated kernel Q2¢ (w) W5‘p (|w|) and the corresponding cutoff kernel 


Qe (w) 1s.zj (JI) |w|* ” satisfies (since both kernels vanish at the origin) 
[Xe Ow) Wein Cll) = Qe Ow) Uo.ay Chol) wl | 


lore) lore) 
< be a {(2-*3)*™ tay (\w|)} +4 ye {(2*R)*" diwopaen (lw)} 


k=0 k=1 


lo) oo 
= » Qn) Kg (w) + > QO Kok (w), 


k=0 k=1 


a1 


n—at 


where the kernels K, (w) = 1{p,2»] (Jw) are easily seen to satisfy, uniformly in 
p, the norm inequality (12) with constant controlled by the offset AJ condition (7) 
below. The equivalence of the norm inequality for these two families of truncations 
now follows from the summability of the series )~?> 5 2-K—®) for 0 < a <n. The 
case of more general families of truncations and operators is similar. 


Quasicube Testing Conditions 


The following “dual’ quasicube testing conditions are necessary for the boundedness 
of T® from L? (c) to L? (@), 
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eye af |T* (Iga) |? » < 00, 
|O|, Q 


QEQP” 
1 a 
(TH)? = sup arf |(T°)* (1g@) |? o < 00, 
QEQpPp" wo YQ 


and where we interpret the right sides as holding uniformly over all tangent line 
truncations of T”. 


Remark 2.3 We alert the reader that the symbols Q, J, J, K will all be used to denote 
either cubes or quasicubes, and the context will make clear which is the case. 
Throughout most of the proof of the main theorem only quasicubes are considered. 


Quasiweak Boundedness Property 


The quasiweak boundedness property for T* with constant C is given by 


< WBP ra Vv |Ql., lQ'l. 


i. T° (lgc) dw 
Q 


1 
for all quasicubes Q, OQ’ with = < 


and either 0 C 30’ \ O' or O' C 30\ QO, 


and where we interpret the left side above as holding uniformly over all tangent line 
trucations of T*. Note that the quasiweak boundedness property is implied by either 
the tripled quasicube testing condition, 


||1s9T* (1g0) | es < oo Jollz)- for all quasicubes Q in R", 


or its dual defined with o and w interchanged and the dual operator T** in place 
of T°. In turn, the tripled quasicube testing condition can be obtained from the 
quasicube testing condition for the truncated weight pairs (, 1g0). 


Poisson Integrals and A; 


Let yz be a locally finite positive Borel measure on R”, and suppose Q is an Q2- 
quasicube in R”. Recall that iol =~ £(Q) for a quasicube Q. The two a-fractional 


Poisson integrals of 4 on a quasicube Q are given by: 
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P*(Q, ph) = i ie" FEE) 
B (lOl" + |x—xol) 
P* (O,u) = i lol" cme 


"\ (lal + x—sol) 


where we emphasize that |x — xg| denotes Euclidean distance between x and xg and 
|Q| denotes the Lebesgue measure of the quasicube Q. We refer to P® as the standard 
Poisson integral and to P* as the reproducing Poisson integral. 

We say that the pair K, K’ in P” are neighbours if K and K’ live in a common 
dyadic grid and both K Cc 3K’ \ K’ and K’ C 3K \ K, and we denote by N" the set 
of pairs (K, K’) in P” x P” that are neighbours. Let 


QN" = {(QK, QK’) : (K, K’) « N"} 


be the corresponding collection of quasineighbour pairs of quasicubes. Let o and 
@ be locally finite positive Borel measures on R”, possibly having common point 
masses, and suppose 0 < a < n. Then we define the classical offset Al constants by 


Dl Ola e 


AS (0,0) = aa 
(2.0)€2N" |Q|~" |Q" 


Since the cubes in P” are products of half open, half closed intervals [a, b), the 
neighbouring quasicubes (Q, Q’) € QN" are disjoint, and the common point masses 
of o and w do not simultaneously appear in each factor. 

We now define the one-tailed AS constant using P”. The energy constants E3""* 
introduced in the next subsection will use the standard Poisson integral P*. 


Definition 2.4 The one-tailed constants AS and A” for the weight pair (0, w) are 
given by 


AS = sup P* (O,1o-c) elo < 00, 
QEQP" |Q|"* 
ak |Ol, 
At* = sup P* (OQ, 1g) a 
bear ig\ 


Note that these definitions are the analogues of the corresponding conditions 
with ‘holes’ introduced by Hyténen [5] in dimension n = 1 — the supports of the 
measures 1g-o and 1gw in the definition of A} are disjoint, and so the common 
point masses of o and m do not appear simultaneously in each factor. Note also that, 
unlike in [28], where common point masses were not permitted, we can no longer 
assert the equivalence of A> with holes taken over guasicubes with A} with holes 
taken over cubes. 
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Punctured Ay Conditions 


Ila !Qlo fails to 


1-2 pH 1-2 
be finite when the measures o and w have a common point ae Seiply let Q 
in the sup above shrink to a common mass point. But there is a substitute that is 
quite similar in character that is motivated by the fact that for large quasicubes Q, 
the sup above is problematic only if just one of the measures is mostly a point mass 
when restricted to Q. The one-dimensional version of the condition we are about to 
describe arose in Conjecture 1.12 of Lacey [9], and it was pointed out in [6] that 
its necessity on the line follows from the proof of Proposition 2.1 in [16]. We now 
extend this condition to higher dimensions, where its necessity is more subtle. 

Given an at most countable set 8 = {p,}72, in R", a quasicube Q € QP", anda 
positive locally finite Borel measure ju, define 


As mentioned earlier, the classical Aj characteristic supgegg” 


KL (Q,P) = |Q|,, — sup {ue (px) : Pe E ONY}, 


where the supremum is actually achieved since ys peony (Px) < co as is locally 
finite. The quantity jz (Q, 3B) is simply the 7 measure of Q where J is the measure [u 
with its largest point mass from ‘¥ in Q removed. Given a locally finite measure pair 
(o, w), let Brow) = {pe}p2, be the at most countable set of common point masses 
of o and w. Then the weighted norm inequality (6) typically implies finiteness of 
the following punctured Muckenhoupt conditions: 


Ww (Q, Biow)) |OQl, 


Ae (c,) = sup me = 
gear" |Q|"  |Ql" 
mm (c,@) = |Q\., oy (O, Biow)) 


su = = 
gear" |Q|" |Q|" 


Lemma 2.5 Let T% be an a-fractional singular integral operator as above, and 
suppose that there is a positive constant Co such that 


AS (0, @) < CoMqu (0,0), 
for all pairs (o,@) of positive locally finite measures having no common point 
masses. Now let o and w be positive locally finite Borel measures on R" and let 
Bic) be the possibly nonempty set of common point masses. Then we have 


ae (c,@) + ae (0,) < AC) Ma (0,@). 


Proof Fix a quasicube Q € QP". Suppose first that Big) A Q = fdr, is finite 
with an even number of points. Choose kj € Noy = {1,2,...,2N} so that 


0 (Pr,) = 10 (px) - 
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Then choose ky € Noy \ {k1} such that 


@ (Pr) = max @ (px). 


KEN \{ki} 
Repeat this procedure so that 
o (Phon1) = max oO (px) ’ Kom-+1 € Non \ {ki, see kom} ’ 
kENoy \ {ki ,..-k2m } 
® (Phongr) = max (pe), Komt-2 © Now \ {kt . Komi} s 


kENoy\{ki...-kom4ti} 


for each m < N — 1. It is now clear that both 


N-1 N-1 


as o (Pri) = a(Q a Pow) ) and 20° © (Proi42) ea 


i=0 


[@ (ON Bow) — @ (Pi)] - 


See 
Nile 


In the case of an odd number 2N — 1 of common point masses, the second inequality 
will have N — 1 replaced with N — 2. 

Now, returning to the case of 2N common point masses, define new measures 0 
and @ by 


N-1 


N- 
-) ‘3 (Pix+2) Spb, 42 and @ = 1gw — YJ (Pioit1) Spy: 
i=0 i=0 


so that 


lQls = = 5(0 le and lQls aaa 5° (QO, Bow) 


Now © and @ have no common point masses and Sty« (a, w) is monotone in each 
measure separately, so we have 


© (2, Bow) |Ol, 
jar" jay 


< 4AS (&,O) < 4CoN ju (FO) < 4Cy Me (0, ). 


Thus AS?" (o,@) < 4C)9t4. (o,@) if the number of common point masses 
in Q is finite. A limiting argument proves the general case. The dual inequality 


A (0,@) < 4C)%qu (0, @) now follows upon interchanging the measures o 
and w. oO 


Now we turn to the definition of a quasiHaar basis of L? (i). 
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A Weighted QuasiHaar Basis 


We will use a construction of a quasiHaar basis in R” that is adapted to a measure ju 
(cf. [20] for the nonquasi case and [7] for the geometrically doubling quasi-metric 
space case). Given a dyadic quasicube Q € QD, where D is a dyadic grid of cubes 
from P", let AS denote orthogonal projection onto the finite dimensional subspace 
16 (2) of L? (2) that consists of linear combinations of the indicators of the children 
€ (Q) of Q that have z-mean zero over Q: 


Lg = sf= Y> aglg :ag eR, | fay =0 
O/ee(Q) Q 


Then we have the important telescoping property for dyadic quasicubes Q; C Q» 
(where [Q), 2) = {Q dyadic : Q; C Q S Qp}): 


lo (){ D> AGF) | =1e @ (EG,f-EGf), Q€e€(Q).fel (uw). 


Qe[Q1,02) 


(8) 
We will at times find it convenient to use a fixed orthonormal basis {10°} ver of 


16 (4) where T, = {0,1}” \ {1} is a convenient index set with 1 = (1,1,...,1). 


[a 
Then tho liek and QEQD 
that we add the constant function 1 if jz is a finite measure. In particular we have 


Be Fo} _ 3 (a i, 2 


acl, 


is an orthonormal basis for L” (1), with the understanding 


’ 


lau = > 1ASflaw = FO 
QEQD QEND 


where the measure is suppressed in the notation f. Indeed, this follows from (8) and 
Lebesgue’s differentiation theorem for quasicubes. We also record the following 
useful estimate. If J’ is any of the 2” QD-children of J, and a € T,,, then 


|S xf G2 


(9) 


The Strong Quasienergy Conditions 


Given a dyadic quasicube K € QD and a positive measure jt we define the 
quasiHaar projection Pk = >> AS on K by 


JEQD: JCK 
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2 
P= ieee end (Pele = DS ee, 


JEQD: JCK a€ly, JEQD: JCK a€ly, 


> 


and where a quasiHaar basis {ni ie EL, and JEQDQ adapted to the measure jz was 
defined in the subsubsection on a weighted quasiHaar basis above. 

Now we define various notions for quasicubes which are inherited from the same 
notions for cubes. The main objective here is to use the familiar notation that one 
uses for cubes, but now extended to (2-quasicubes. We have already introduced 
the notions of quasigrids QD, and center, sidelength and dyadic associated to 
quasicubes Q € Q°D, as well as quasiHaar functions, and we will continue to extend 
to quasicubes the additional familiar notions related to cubes as we come across 
them. We begin with the notion of deeply embedded. Fix a quasigrid QD. We say 
that a dyadic quasicube J is (r, ¢)-deeply embedded in a (not necessarily dyadic) 
quasicube K, which we write as J €,,, K, when J C K and both 


£(VJ) <2 E(k), (10) 


1 
qdist (J, 9K) = 5¢ ti. 


where we define the quasidistance qdist (E, F) between two sets E and F to be the 
Euclidean distance dist (Q7'E, Q7'F) between the preimages Q7'E and Q7'F of 


E and F under the map Q, and where we recall that @ (J) ~ |J| ”. For the most part 
we will consider J €;- K when J and K belong to a common quasigrid QD, but an 
exception is made when defining the strong energy constants below. 

Recall that in dimension n = 1, and for a = 0, the energy condition constant 
was defined by 


2, 
La)’ 


1 .(P* bey" 
& = sup 3 a =) |Pex 


I=UI, Il, r=1 


where / and J, are intervals in the real line, and U denotes a pairwise disjoint union. 
The extension to higher dimensions we use here is that of “strong quasienergy 
condition’ below. Later on, in the proof of the theorem, we will break down this 
strong quasienergy condition into various smaller quasienergy conditions, which 
are then used in different ways in the proof. 

We define a quasicube K (not necessarily in QD) to be an alternate QD- 
quasicube if it is a union of 2” QD-quasicubes K’ with side length £ (K’) = se (K) 
(such quasicubes were called shifted in [28], but that terminology conflicts with 
the more familiar notion of shifted quasigrid). Thus for any (2D-quasicube L there 
are exactly 2” alternate (2D-quasicubes of twice the side length that contain L, and 
one of them is of course the (2D-parent of L. We denote the collection of alternate 
QD-quasicubes by AQD. 
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The extension of the energy conditions to higher dimensions in [28] used the 
collection 


Mr,c—deep (K) = {maximal J Ey. K} 


of maximal (r,¢)-deeply embedded dyadic subquasicubes of a quasicube K (a 
subquasicube J of K is a dyadic subquasicube of K if J € QD when QD is a 
dyadic quasigrid containing K). This collection of dyadic subquasicubes of K is of 
course a pairwise disjoint decomposition of K.We also defined there a refinement 
and extension of the collection M (,,)—deep (K) for certain K and each £ > 1. For an 
alternate quasicube K € AQD, define M (,<)—deep,aD (K) to consist of the maximal 
r-deeply embedded (2D-dyadic subquasicubes J of K. (In the special case that K 
itself belongs to QD, then M (r,c)—deep, 2D (K) a M r,e)—deep (K).) Then in [28] for 
£ > 1 we defined the refinement (where 2K’ denotes the @” ancestor of K’ in the 
grid): 


Mi, e)—deep,2D (K) — {J € M (r,2)—deep,2D (x‘K’) for some K’ € Cop (K) ; 


J C Lfor some L € MG ¢)—deep (K)} : 


where €gp (K) is the obvious extension to alternate quasicubes of the set of QD- 
dyadic children. Thus Mi ates ae (K) is the union, over all quasichildren K’ of 
K, of those quasicubes in M (r,c)—deep («°K’ ) that happen to be contained in some 
L € Mc)—deep,aD (K). We then define the strong quasienergy condition as follows. 


Definition 2.6 Let 0 < a@ < n and fix parameters (r,¢). Suppose o and @ are 
positive Borel measures on R” possibly with common point masses. Then the strong 
quasienergy constant €""® is defined by’ 


(em zap up YY e 2) [PoP 


QD y= a n 
=u, | Mo [= 1JEeMye— deep Ur) lJ 


I P* (J, lio) 
+sup sup sup — > ——_—. | P?x el pay 
2D i1eAND >0 lo Mo 
JEM, 6) — —deep, ep) 


Similarly we have a dual version of E""°"® denoted E*°"**, and both depend 
on r and ¢ as well as on n and a. An important point in this definition is that the 
quasicube / in the second line is permitted to lie outside the quasigrid QD, but only 
as an alternate dyadic quasicube J € AQD. In the setting of quasicubes we continue 


; . : 2 : 
to use the linear function x in the final factor | Pex|| Dw) of each line, and not the 


’The first line in the display in Definition 5 in [29] is missing notation that is corrected here. 
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pushforward of x by Q. The reason of course is that this condition is used to capture 
the first order information in the Taylor expansion of a singular kernel. There is a 
logically weaker form of the quasienergy conditions that we discuss after stating 
our main theorem, but these refined quasienergy conditions are more complicated 
to state, and have as yet found no application - the strong energy conditions above 
suffice for use when one measure is compactly supported on a C!* curve as in [31]. 


Statement of the Theorems 


We can now state our main quasicube two weight theorem for general measures 
allowing common point masses, as well as our application to energy dispersed 
measures. Recall that Q : R” — R" is a globally biLipschitz map, and that QP” 
denotes the collection of all quasicubes in R” whose preimages under Q are usual 
cubes with sides parallel to the coordinate axes. Denote by QD C QP” a dyadic 


quasigrid in IR”. For the purpose of obtaining necessity of Aj for 5 < a@ <n, we 
adapt the notion of strong ellipticity from [30]. 
J 
Definition 2.7 Fix a globally biLipschitz map Q. Let T* = \T be a vector of 
j=l 
J 
singular integral operators with standard kernels {xe} . We say that T° is strongly 
= 


elliptic with respect to Q if for each m € {1, —1}", there is a sequence of coefficients 


J 
{art such that 
Tf jay 


J 
Sank? (xx + m)) > cf", teR, (11) 
j=l 


holds for a// unit vectors u in the quasi-n-ant QV,, (i.e. an n-dimensional quasi- 
quadrant) where 


Vin = {x € R": mx; > O for 1 <i <n}, me {1,-1}". 


Theorem 2.8 Suppose that T° is a standard a-fractional singular integral operator 
on R", and that w and o are positive Borel measures on R" (possibly having 
common point masses). Set Tf = T° (fo) for any smooth truncation of T?. Let 
Q : IR" > R" be a globally biLipschitz map. 


1. Suppose 0 < a <n. Then the operator T® is bounded from L? (o) to L? (w), i.e. 


To lliz@) SMe Willa » (12) 


uniformly in smooth truncations of T*, and moreover 
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Nr < Ca (V AS + AS ALE EASE PM L Try  Thy $f EMME EMTOMEM WEP) ; 


provided that the two dual A5 conditions and the two dual punctured Mucken- 
houpt conditions all hold, and the two dual quasitesting conditions for T” hold, 
the quasiweak boundedness property for T® holds for a sufficiently large constant 
C depending on the goodness parameter r, and provided that the two dual strong 
quasienergy conditions hold uniformly over all dyadic quasigrids OD C QP", 
ie, ESOP 4. ES*OME* < Oo, and where the goodness parameters ¥ and ¢ in the 
definition of the collections M (,2)—deep (K) and Mi tees ais (K) appearing in 
the strong energy conditions, are fixed sufficiently large and small respectively 
depending only on n and a. 


J 
2. Conversely, suppose 0 < a < nand that T® = \rs\ ; is a vector of Calderén- 
j= 


J 
Zygmund operators with standard kernels {Ke . In the range 0 < a < 5, we 


assume the ellipticity condition from ((30]): there is c > 0 such that for each unit 
vector u there is j satisfying 


[Ke (x,x+ tu)| >a", teR. (13) 


For the range 5 < & <_n, we assume the strong ellipticity condition in 


Definition 2.7 above. Furthermore, assume that each operator T;* is bounded 
from L? (c) to L? (a), 


< Nye Uf llize)- 


Lo) 


(a), 


Then the fractional Aj conditions (with ‘holes’) hold as well as the punctured 
Muckenhoupt conditions, and moreover, 


43 a. foe 4 Ae 4. Ae < Cote. 


Problem 1 Given any strongly elliptic vector T° of classical a-fractional 
Calderén-Zygmund operators, it is an open question whether or not the usual 
(quasi or not) energy conditions are necessary for boundedness of T®. See [27] for 
a failure of energy reversal in higher dimensions — such an energy reversal was 
used in dimension n = | to prove the necessity of the energy condition for the 
Hilbert transform, and also in [26] and [14] for the Riesz transforms and Cauchy 
transforms respectively when one of the measures is supported on a line, and in 
[31] when one of the measures is supported on a C' curve. 


Remark 2.9 If Definition 2.7 holds for some T®% and Q, then Q must be fairly tame, 
in particular the logarithmic spirals in Example 1.2 are ruled out! On the other hand, 
the vector of Riesz transforms R®” is easily seen to be strongly elliptic with respect 
to Q if Q satisfies the following sector separation property. Given a hyperplane H 
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and a perpendicular line L intersecting at point P, there exist spherical cones Sy 
and S$; intersecting only at the point P’ = Q(P), such that H’ = QH C Sy and 
L’= QL C S; and 


dist (x, 0Sq) ~ |x|, x € H and dist (x, 0Sz) ~ |x|, x € L. 


Examples of globally biLipshcitz maps Q that satisfy the sector separation property 
include finite compositions of maps of the form 


Q (x1,2") = (x12 + Wx), (x1,x’) eR", 


where w : R > R"! is a Lipschitz map with sufficiently small Lipschitz constant. 


In order to state our application to energy dispersed measures, we introduce some 
notation and a definition. Fix a globally biLipschitz map Q : R" > R". For0 <k < 
n—1, denote by L; the collection of all k-dimensional planes in R”. If in addition J 
is an (2-quasicube in R”, denote by M7? (J, jz) the ‘moments’ 


M? (VJ, u)? = inf i dist (x, L)* du (x), 
LeLi Jy 


and note that Mj (J, 1) is related to the energy E(J, 4) = ,/Ey 


wh J xd (x): 


\J|" 


n * 2: 2 
M3 On)? = fs Bys]? dyes) = VI, VEO, 


Clearly the moments decrease in k and we now give a name to various reversals of 
this decrease. 


Definition 2.10 Suppose ju is a locally finite Borel measure on R”, and let k be an 
integer satisfying 0 < k < n— 1. We say that py is k-energy dispersed if there is a 
positive constant C = C;,, such that for all Q-quasicubes J, 


Mi (J, 1) < CM? (J, 1). 


If both o and @ are appropriately energy dispersed relative to the order 
0 <a <n, then the T1 theorem holds for the a-fractional Riesz vector transform 
R°”. 


Theorem 2.11 Let0 <a <nand0<k<n-—1 sastisfy 


n-k<a<n,aA#An-1 if l<k<n-2 
O<a<n,a4¢l,n-1 if k=n-1 
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Suppose that R*” is the a-fractional Riesz vector transform on RR", and that w and o 
are k-energy dispersed locally finite positive Borel measures on IR" (possibly having 
common point masses). Set RE"f = R®" (fo) for any smooth truncation of R%". 
Let Q : R" > R" be a globally biLipschitz map. Then the operator R®&" is bounded 
from L? (a) to L? (a), i.e. 


IRoflliz@) < Mere If lle » 


uniformly in smooth truncations of R®”, if and only if the Muckenhoupt conditions 
hold, the testing conditions hold and the weak boundedness property holds. 
Moreover, we have the equivalence 


Maen wy (A ae ae Oe 4. Arne Sema de Thon + WBP pen 


The case k = n — 1 of k-energy dispersed is similar to the notion of uniformly 
full dimension introduced by Lacey and Wick in [18, versions 2 and 3]. The proof of 
Theorem 2.11 shows that we can also take w and o to be k; and kz energy dispersed 
respectively, provided a satisfies the hypotheses with respect to both k, and kp. 


Proof of Theorem 2.8 


We now give the proof of Theorem 2.8 in the following sections. Sections “Mono- 
tonicity Lemma and Energy Lemma”, “Corona Decompositions and Splittings” and 
“Energy Dispersed Measures” are largely taken verbatim from the corresponding 
sections of [28], but are included here since their omission here would hinder the 
readability of an already complicated argument. 


Good Quasicubes and Energy Muckenhoupt Conditions 


First we extend the notion of goodness to quasicubes. 


Definition 3.1 Let r € N and 0 < e < 1. Fixaquasigrid QD. A dyadic quasicube J 
is (r, €)-good, or simply good, if for every dyadic superquasicube /, it is the case that 
either J has side length greater than 2 times that of J, or J EG, J is (1, €)-deeply 
embedded in /. 


Note that this definition simply asserts that a dyadic quasicube J = QJ’ is (r, €)- 
good if and only if the cube J’ is (r, ¢)-good in the usual sense. Finally, we say that 
J is r-nearby in K when J C K and 


£(J) > 27*L(K). 
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The parameters r,¢ will be fixed sufficiently large and small respectively later in 
the proof, and we denote the set of such good dyadic quasicubes by 2D ¢2)—gooa; 
or simply Q2Dgooq When the goodness parameters (r, €) are understood. Note that 
if J! € QDe.)—gooa and if J’ C K € QD, then either J’ is r-nearby in K or 
JI CcJ&. K. 

Throughout the proof, it will be convenient to also consider pairs of quasicubes 
J,K where J is (p, £)-deeply embedded in K, written J €,,. K and meaning (10) 
holds with the same ¢ > 0 but with p in place of r; as well as pairs of quasicubes 
J, K where J is p-nearby in K, £ (J) > 27~°£ (K), for a parameter p > r that will be 
fixed later. 


Notation 2. We will typically use the side length € (J) of a Q-quasicube when we 
are describing collections of quasicubes, and when we want ¢ (J) to be a dyadic or 


related number; while we will typically use |J | in estimates, and when we want to 
compare powers of volumes of quasicubes. We will continue to use the prefix ‘quasi’ 
when discussing quasicubes, quasiHaar, quasienergy and quasidistance in the text, 
but will not use the prefix ‘quasi’ when discussing other notions. In particular, 
since quasiAS + quasiAyh"™" ~ AS + ASP" (see e.g. [31] for a proof) we do 
not use quasi as a prefix for the Muckenhoupt conditions, even though quasiA}5 
alone is not comparable to AS. Finally, we will not modify any mathematical 
symbols to reflect quasinotions, except for using QD to denote a quasigrid, and 
qdist (FE, F) = dist (Q7'E, Q7'F) to denote quasidistance between sets E and F, 
and using |x — Ylqaist = |Q7"x - Q71y| to denote quasidistance between points x 
and y. This limited use of quasi in the text serves mainly to remind the reader we are 
working entirely in the ‘quasiworld’. 


Energy Muckenhoupt Conditions 

We now show that the punctured Muckenhoupt conditions Pa and a 
control respectively the ‘energy A} conditions’, denoted AC and Aan 
where 


Pts 

Hs QQ) | 72 

sg 0) ap, th Ol, 
QEQP" | Q| n | Q| n 


2 


(14) 


po _x_ 

ial, [Pearl 
1—2 1-2 

genP" |Ql" [QI " 


Faas (o, o) = 


These energy A¥ conditions play a critical role in controlling local parts of functional 
energy later in the paper, and it is a crucial requirement that they are necessary 
conditions, as shown by the next lemma. 
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Lemma 3.2 For any positive locally finite Borel measures 0, w we have 


AS") (6, m) < max {n, 3} AS" (G,@), 


ee (0, o) < max {n, 3} Ag* Punct (0, @) ; 


Proof Fix a quasicube Q € QD. If w (OQ, Peale 5 |Q|,,, then we trivially have 


2 
Pe 
[Pez lia) Ql <n 2lo_ Ole 
jar-*  jai-* ~ “jai* jg 
ole: Pe w)) ele < onary (0, @) } 
|Q| ele 


On the other hand, if w (Q, Bow) < 
such that 


5 |Q|,, then there is a point p € ON Biow) 


1 
© ((P}) > 5 1Qle 


and consequently, p is the largest w-point mass in Q. Thus if we define @ = w — 
w ({p}) 6,, then we have 


w (Q, Biow)) = lQls. 
Now we observe from the construction of Haar projections that 
A® = A’, forall J € QDwithp ¢ J. 


So for each s > 0 there is a unique quasicube J, € QD with € (J;) = 27°£ (Q) that 


contains the point p. For this quasicube we have, if {np “} yeav. acer, 1S 4 basis for 
L’ (o), 
2: 2 2 
|aex Vo) = Ss (ng), | = oe (hy. x— Ph, 
acl, acl, 
2 2 
=| f m@e-maow) =O] f mw @o-naoo 
acl, Js acl, Js 
<A Tw Mb Pew SL Mee Ne Ms OOP 
acl, acl, 


< n2"€ 5) slo < 277°C (0) lOlz- 


Thus we can estimate 
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x |’ 1 2 
pe _—_ = —_. Ax 
| °¢(Q) L?(a) £(0) 7 | z Ii.) 
1 ~ 12 — 2 
=a) D4 9sheem + AR lee 
£{Oy JEQD: ptICO o =0 =“ 
1 a2 = —2s 2 
s (oy: (aes 7 27 £(Q)°|Q :) 
1 [oe) 
< —; (|) l@ls+ 2-7 (0? 12s) 
£(Qy ( 2 
< 3|Qlz < 30 (2, Brow) « 
and so 
Pe x i 
Peat 20) 1Qlo_ - 30 (2 Boo) [Cle < 44200 (gg) 
ja jaime ~ gine jafinn 


Now take the supremum over Q € QD to obtain AS“""*” (6, @) < max {n, 3} Ay" 
(o, w). The dual inequality follows upon interchanging the measureso andw. O 


Plugged A$*"""?"2 Conditions 


Using Lemma 3.2 we can control the ‘plugged’ energy A; conditions: 


wm _ xX 
| QUO) | 72 
Agee PME (9,0) = sup —— (0,0), 
gear = Qh 
| ox 
2 L(Q) || p2 
Ay @P8 (9,@) = sup PY (Q,0) —— 
QEQP” |O|"* 


Lemma 3.3 We have 


Aa (o, @) < A (o, o) A. ae (o, @) 


AT eMeePh® (gw) S AZ" (0,0) + AS (6,0). 
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Proof We have 


Pests, |Pszas\, [Peas 
- (O,o) = 1) pa (Oo 1 ) ope (Q 1 a) 
lo" ia jo 
Niger 7 
£(Q) 
< les (0.tge) + Pele 10h, 
|O|* Ol" = =|Ql" 


S AS (o,@) + Ay (0,0). 


Random Grids and Shifted Grids 


Using the analogue for dyadic quasigrids of the good random grids of Nazarov, Treil 
and Volberg, a standard argument of NTV, see e.g. [35], reduces the two weight 
inequality (12) for T* to proving boundedness of a bilinear form 7° (f, g) with 
uniform constants over dyadic quasigrids, and where the quasiHaar supports suppf 
and suppg of the functions f and g are contained in the collection QD®°™ of good 
quasicubes, whose children are all good as well, with goodness parameters r < 00 
and ¢ > O chosen sufficiently large and small respectively depending only on n and 
a. Here the quasiHaar support of f is suppf = {7 EQD: AIF A o}, and similarly 
for g. In fact we can assume even more, namely that the quasiHaar supports suppf 
and suppg of f and g are contained in the collection of t-good quasicubes 
QD = {K € QD: Cx C QD ee)—gooa and WHHK € LD(r,¢\—gooa for allO << t}, 
(15) 


ie 
(r,¢)—good 


that are (r, €)-good and whose children are also (r, ¢)-good, and whose £-parents 
up to level t are also (r, €)-good. Here t > r is a parameter to be fixed later. We 
may assume this restriction on the quasiHaar supports of f and g by the following 
lemma. See [29] for a proof.® 


Lemma 3.4 Givenr > 3, t > 1 andi <e<1-—14, we have 


QD (r—1,3)—good Cc QD? 


(r,e)—good’ 
provided 
-1 
0<8< — , (16) 
T 


8This lemma is misstated in [30]. 
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For convenience in notation we will sometimes suppress the dependence on a 
in our nonlinear forms, but will retain it in the operators, Poisson integrals and 
constants. More precisely, let QD° = QD® be an (r, €)-good quasigrid on R", 


O,a 
and let thy Feat acl, 


as described above, so that 


f=) AS SS Pere 3 FaOR, 


and {ny be corresponding quasiHaar bases 
JEQD®, bELy, 


IEQD? IEQD?, acl, IEQD°?, acl, 
_ oO, _ a,b ob _ ~~]. @,b 
go eS See DS eee. 
JEQD® JEQD®, bET, JEQD®, bel, 


where the appropriate measure is understood in the notation f (I ;a) and (J; b), and 
where these quasiHaar coefficients f (I; a) and (J; b) vanish if the quasicubes J and 
J are not good. Inequality (12) is equivalent to boundedness of the bilinear form 


PR VM=(MAN.%.= Yd — (Te(APf). AZ), 


TEQD® and JEQD® 


on L’ (a) x L? (a), ie. 


IPF. 8)| S Mee [lll Mlli2) » (17) 


uniformly over all quasigrids and appropriate truncations. We may assume the two 
quasigrids QD? and QD*® are equal here, and this we will do throughout the paper, 
although we sometimes continue to use the measure as a superscript on QD for 
clarity of exposition. Roughly speaking, we analyze the form 7* (f, g) by splitting 
it in a nonlinear way into three main pieces, following in part the approach in [12] 
and [13]. The first piece consists of quasicubes J and J that are either disjoint or of 
comparable side length, and this piece is handled using the section on preliminaries 
of NTV type. The second piece consists of quasicubes J and J that overlap, but are 
‘far apart’ in a nonlinear way, and this piece is handled using the sections on the 
Intertwining Proposition and the control of the functional quasienergy condition by 
the quasienergy condition. Finally, the remaining local piece where the overlapping 
quasicubes are ‘close’ is handled by generalizing methods of NTV as in [11], and 
then splitting the stopping form into two sublinear stopping forms, one of which 
is handled using techniques of [16], and the other using the stopping time and 
recursion of M. Lacey [8]. See the schematic diagram in section “Doubly Iterated 
Coronas and the NTV Quasicube Size Splitting” below. 

We summarize our assumptions on the Haar supports of f and g, and on the 
dyadic quasigrids QD. 
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Condition 1 (on Haar supports and quasigrids) We suppose the quasiHaar 
supports of the functions f and g satisfy suppf,suppg C QDE,. «)—good" We also 


assume that |0Q|,4,, = 0 for all dyadic quasicubes Q in the grids QD (since this 
property holds with probability 1 for random grids QD). 


Necessity of the A; Conditions 


Here we consider in particular the necessity of the fractional AJ condition (with 
holes) when 0 < a < n, for the boundedness from L? (a) to L? (w) (where o and 
@ may have common point masses) of the a-fractional Riesz vector transform R® 
defined by 


g=¥ 


ae ? 


Roy) = [KM OIdoO), KF (Hy) = 


|x — 


whose kernel K;* (x, y) satisfies (5) for 0 < a < n. More generally, necessity holds 
for elliptic operators as in the next lemma. See [30] for the easier proof in the case 
without holes. 


Lemma 4.1 Suppose 0 < a <n. Let T® be any collection of operators with a- 
standard fractional kernel satisfying the ellipticity condition (13), and in the case 
y <a <n, we also assume the more restrictive condition (11). Then for0 <a <n 
we have 


At < My (T*). 


Proof First we give the proof for the case when 7% is the a-fractional Riesz 


transform R®, whose kernel is K® (x,y) = ——=4— . Define the 2” generalized 


lx—yl 


n-ants Q,, form € {—1, 1}”, and their translates Q,, (w) for w € R” by 
Om = {1,-0.6 5Xn) xX > O}~, An (w) = {z:z-wEe Qa}, weR’. 
Fix m € {—1, 1}” and a quasicube J. For a € R" andr > 0 let 


£() 


TO + pew’ 17 9) = Fe-ntarsio.y 0) 81 OY" 


87 (x) = 


where ¢, is the center of the cube J. Now 


LD) |x-y| SLM |x- S| + £C [r —y| S [EW + |x - SIE + 1% — Il 
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implies 


1 


re |= = Ts). x,yeER’. 


Now the key observation is that with LC = m- ¢, we have 


L(x—y)=m-(x—y)2 |x-yl, x€ Qn), 


which yields 


ey) tL" 5, (0) 5, OY", (18) 


L(K* (x, y)) = aie 
jx—y|"*" Ix—y| 


provided x € Q,,(y). Now we note that x € Q,,(y) when x € Q,, (a) and y € 
OQ- (a) to obtain that for x € Q,, (a), 


L(x—y) 


L(T* (fara) 0) = f —)) 5 (y) do) 


Q--m(a)NB(O,r) |x — y| 


> LN" 5; @)" / 51)" do (y). 


Q-m(@NB(0,r) 


Applying |L¢| < ./n|¢| and our assumed two weight inequality for the fractional 
Riesz transform, we see that for r > 0 large, 


2 
08 eis / vac) midis ( i sy (y)*"- da 0») do (x) 
Qn(a) Q-m(a)NBO,r) 


$Me Moy S Pe RY Walling = FRY fs Odo 0). 


Rearranging the last inequality, and upon letting r — oo, we obtain 


ey ep * a2 
dw (x d < MMe (R®)°. 
i (€ (1) + |x — Gr)" os ae (€() + ly — &))*"-# Te” 


Note that the ranges of integration above are pairs of opposing n-ants. 
Fix a quasicube Q, which without loss of generality can be taken to be centered 
at the origin, €g = 0. Then choose a = (2¢ (Q) ,2£(Q)) and J = Q so that we have 


£(Q)"* ( Pa ) 
———_, 4 x £ d 
(= (€(O) + |x|)2"-2 ow ( ) (Q) [ o 


£(Q)"— o £(Q)"— a st 
ie SMe (R*)’. 
7 eo €@) + |x) 7a 0) J. 2 m(a) (€(Q) + ly)" 7a 40 0) &) 
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Now fix m = (1,1,..., 1) and note that there is a fixed N (independent of £ (Q)) and 
a fixed collection of rotations {, an such that the rotates ppQn (a), 1 < k < N, 
of the n-ant Q,,, (a) cover the complement of the ball B (0, 4,/n€ (Q)): 


N 
B(0,4/n€ (Q))° CL) ex Qn (a). 


k=1 


Then we obtain, upon applying the same argument to these rotated pairs of n-ants, 


e(gy"* ( _ ) os 
: é do) SM, (R*). (19 
— (€(Q) + |x|)" v0] (2) i o) SMe (R"). (19) 


Now we assume for the moment the offset A condition 


vo (L 0) ([i0) <0 


where Q’ and Q are neighbouring quasicubes, i.e. (Q’,Q) € QN". If we use this 
offset inequality with Q’ ranging over 3Q \ Q, and then use the separation of 
B (0, 4./né (Q)) \ 3Q and Q to obtain the inequality 


Lior d ( do) SAS 
a (i ») l ‘ 


together with (19), we obtain 


Nie 


e(Qy"* ) 2 ( ro ) “ : 
d L d < Ny (R Ag. 
cn CO + pre) EO [ o) Ry ase 


Clearly we can reverse the roles of the measures w and o and obtain 


VJ AX* S My (R®) + AZ 


for the kernels K*®, 0 < a <n. 

More generally, to obtain the case when T° is elliptic and the offset AJ condition 
holds, we note that the key estimate (18) above extends to the kernel = Anke 
of 4 Aj'T;' in (11) if the n-ants above are replaced by thin cones of sufficently 
small aperture, and there is in addition sufficient separation between opposing cones, 
which in turn may require a larger constant than 4./n in the choice of Q’ above. 
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Finally, we turn to showing that the offset AJ condition is implied by the norm 
inequality, i.e. 


Nie 


1 af | 
AS = £(Q)* d —|d < MN, (R*); 
eet tas (ae ») Cat o) SMa 
: / i 2 a2) 2-24 ) 
i.e. do ee SM (RV Ol, (A, OQ) EQN". 


In the range 0 < a@ < 5 where we only assume (13), we adapt a corresponding 


argument from [15]. 
The ‘one weight’ argument on page 211 of Stein [34] yields the asymmetric two 
weight A> condition 


]O'|, IOl, < CMe (R®) |OPO-") , (20) 


where Q and Q’ are quasicubes of equal side length r and distance Cor apart for 
some (fixed large) positive constant Co (for this argument we choose the unit vector 
u in (13) to point in the direction from Q to Q’). In the one weight case treated in [34] 
it is easy to obtain from this (even for a single direction u) the usual (symmetric) A, 
condition. Here we will have to employ a different approach. 

Now recall (see Sec 2 of [24] for the case of usual cubes, and the case of half 
open, half closed quasicubes here is no different) that given an open subset ® of 
IR”, we can choose R > 3 sufficiently large, depending only on the dimension, such 


that if jor are the dyadic quasicubes maximal among those dyadic quasicubes Q 
Jj 
satisfying RQ C 9, then the following properties hold: 


(disjoint cover) ® = U; Q;andQ;N OQ; = Wifi Fj 
(Whitney condition) RO; C ® and 3RQ;M O° ¥ O for all; . (21) 
(finite overlap) >) 430; < Cx 


So fix a pair of neighbouring quasicubes (Q). Qo) € QN", and let {Q;}; be a 
Whitney decomposition into quasicubes of the set = (Q5 x Qo) \ ® relative to 
the diagonal D in R” x R”. Of course, there are no common point masses of w in Qj 
and o in Qo since the quasicubes Q% and Qp are disjoint. Note that if Q; = Q/ x Q;, 
then (20) can be written 

Qiao << CMa (R°) |Qi'™ (22) 
where w x o denotes product measure on R” x R”. We choose R sufficiently large in 
the Whitney decomposition (21), depending on Co, such that (22) holds for all the 
Whitney quasicubes Q;. We have )~, |Q;| = |Q’ x Q| = |Q|’. 
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Moreover, if R = Q’ x Q is a rectangle in R” x R” (i.e. Q’, Q are quasicubes in 
IR”), and if R = U,R; is a finite disjoint union of rectangles R,, then by additivity 
of the product measure w x o, 


Pleo = So UR ilu. « 


Let Qo = Qj x Qo and set 
A= {Q=0'xQ0:QC Qo, £(Q) = £(O’) © Cy 'qdist (Q, O’) and (20) holds} . 


Divide Qo into 2n x 2n = 4n? congruent subquasicubes Qi: er OF of side length 
5, and set aside those G, € A (those for which (20) holds) into a collection of 
stopping cubes I’. Continue to divide the remaining a € A of side length i and 


again, set aside those ely € ® into I, and continue subdividing those that remain. 
We continue with such subdivisions for N generations so that all the cubes not set 
aside into I have side length 2~" . The important property these latter cubes have is 
that they all lie within distance r2~" of the diagonal D = {(x, x) : (x,x) € Qj x Qo} 
in Qo = Qj x Qo since (20) holds for all pairs of cubes Q’ and Q of equal side 
length r having distance at least Cor apart. Enumerate the cubes in I as {Q}}; 
and those remaining that are not in I as { Pie Thus we have the pairwise disjoint 
decomposition 


Qo = (U a 'e 'o P; 
i J 
The countable additivity of the product measure w x o shows that 
IQolioxe = > |Qiloxca + >, |Pilessce 
i J 


Now we have 
DY IQiloxe S >) Me (RQ, 


and 


a k ro = 
be IQ," = > b> (22k)! x > (; an) (228)! " (Whitney) 


kEZ: 2 <l(Qo) i: (Q)=2* kEZ: 2k <l(Qo) 


= CQ)" YY WAHT) < Cyl Qo)" CQ)" = Cu 1Q0 x ol? = Cu 1Qol'™* , 


kEZ: 2k <£(Qo) 
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provided 0 < a < 4. Using that the side length of P; = P; x P; is 2~-% and 
dist (P},D) < C,2-%, we have the following limit, 


So lPiloxe =|LJPi] > 0asN > c, 
j j 


Oxo 


since 'e P; shrinks to the empty set as N — oo, and since locally finite measures 
j 

such as w X o are regular in Euclidean space. This completes the proof that ,/A3 < 

Ny (R°) for the range 0 < a < 5. 

Now we turn to proving Jaz S M%, (R*) for the range 5 < a < n, where 
we assume the stronger ellipticity condition (11). So fix a pair of neighbouring 
quasicubes (K’, K) € QN", and assume that 0 + w doesn’t charge the intersection 
K' 1 K of the closures of K’ and K. It will be convenient to replace n by n + 1, ie 
to introduce an additional dimension, and work with the preimages Q’ = Q7 IK! 
and Q = Q7'K that are usual cubes, and with the corresponding pullbacks 
@O = m X Q*w ando = m, x Q*o of the measures w and o where m, is Lebesgue 
measure on the line. We may also assume that 


=(-1,0)x[]Q@, @=(0,1)x[]a.. 
i=1 i=1 


where QO; = [a;,b;] for 1 < i < n (since the other cases are handled in similar 
fashion). It is important to note that we are considering the intervals Q; here to be 
closed, and we will track this difference as we proceed. 

Choose 9; € [a,, bi] so that both 


1 
—1,0) x [@, bilx J] o: 25 Os 


- 
oO i=2 


w 


I-10 x [a1, O71] x [ [2 


i=2 


Now denote the two intervals [a;, 6;] and [9;, b:] by [ay, bf | and [ay*, b}*] where 
the order is chosen so that 


[0, 1) x [afb x[o] = [0, 1) x [ or oi] TT 


Then we have both 


a and |[0, 1) x [ay*, be*] x Me 


1 
5 | Ole - 


Ns 
[ox ay. bt | To 512 


= Zz 
oO oc 
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Now choose 6) € [d, b2] so that both 


> lig 
= 4 @? 


oO 


[- 1,0) x [at [az, 62] x He) 


—1,0) x [a¥, bY] x [62.2] x [] 2: 


i=3 


and denote the two intervals [a2, 02] and [62, bz] by [as, b¥ | and Ew ps” | where 
the order is chosen so that 


[0, 1) x |[ay*, bf*] x [a5, BF] «To, = (0,2) 3< [aj* by] [ag* be" | Ix[To} 


o 


Then we have both 


> — |e 


w 


Lo x [aisb) | x [ay,b5| x Tle 


i=3 


(0, 1) x ae ad be lasso “ad |x I 2 


and continuing in this way we end up with two rectangles, 


= [—1,0) x [at, by] x [az, b5] x ee 
H = (0,1) x [ay*, by*| x ae x iia [ax* b*], 


norn 


that satisfy 


IGla = |[-1.0) x [at bf] x [a b3] x... x [at be] |. = 


Bie = |x [ater |< [a* oF | x. [a 


n? 


However, the quasirectangles QG and Q4H lie in opposing quasi-n-ants at the 
vertex Q6 = Q (6), 02,...,6,), and so we can apply (11) to obtain that for x € QG, 


J 
Do AarTe one) (x) 


j=l 


a) |x — yl" do (y) 2 |RO|"—! |QH|, « 
QH 


J 
=|) oapKs do 0) 
QH 5 


For the inequality above, we need to know that the distinguished point Q6 is not a 
common point mass of o and a, but this follows from our assumption that 0 + w 
doesn’t charge the intersection K’ M K of the closures of K’ and K. Then from the 
norm inequality we get 
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2 
J 


2 2 Mpa 
|QG|., (I2a)" '|QHI,) sf Sans (lguo)| dw 


j=l 


as 16 ,4do = NK QH|,. 
~w si ante iy QH yas anTe | lo 


from which we deduce that 


I2ePG-) |Qo'|, |Qa|, < 2"2ePG-Y jag}, |QH|, <2", 
j= 


mara? 
cy 


2(4=1) | pr < 72ngy2 
|K| |K’ ls |K|, NO 2 TS amr 


and hence 


a Src 

Thus we have obtained the offset AJ condition for pairs (K’,K) € QN" such 
that o + w doesn’t charge the intersection K’ M K of the closures of K’ and K. 
From this and the argument at the beginning of this proof, we obtain the one-tailed 
A conditions. Indeed, we note that |d (rQ)|,.,, > 0 for only a countable number 
of dilates r > 1, and so a limiting argument applies. This completes the proof of 
Lemma 4.1. Oo 


Monotonicity Lemma and Energy Lemma 


The Monotonicity Lemma below will be used to prove the Energy Lemma, which 
is then used in several places in the proof of Theorem 2.8. The formulation of the 
Monotonicity Lemma with m = 2 for cubes is due to M. Lacey and B. Wick [18], 
and corrects that used in early versions of our paper [28]. 


The Monotonicity Lemma 


For 0 < a <nandm € R4, we recall the m-weighted fractional Poisson integral 


, i" 
P* (J) = i —__dus(y), 


n i 
B (|dI" + ly— cal) 


where Pf (J, 4) = P® (J, jz) is the standard Poisson integral. The next lemma holds 
for quasicubes and common point masses with the same proof as in [30]. 
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Lemma 5.1 (Monotonicity) Suppose that I and J are quasicubes in IR" such that 
J Cc 2J CI, and that yt is a signed measure on R" supported outside I. Finally 
suppose that T* is a standard a-fractional singular integral on R" withO <a <n. 
Then we have the estimate 


AGT? L124) SO J, lH), (23) 


~N 


where for a positive measure Vv, 


2 2 
Pe ,v) 2 Pig Vv) 
a Doss ’ o 1+6 2 
= (=e |APx[ re) + (met IIx— my|lp24,0) » 


The Energy Lemma 


Suppose now we are given a subset H of the dyadic quasigrid QD®. Let 
P? = Yyex AP be the corresponding w-quasiHaar projection. We define 
H* = UJ {J’ € QD® : J’ C Jt. The next lemma also holds for quasicubes and 
SEL 

common point masses with the same proof as in [30]. 

Lemma 5.2 (Energy Lemma) Let J be a quasicube in QD®. Let V; be an L? (w) 
function supported in J and with w-integral zero, and denote its quasiHaar support 
by H = suppW, = \K EQD®: W; (K) x of. Let v be a positive measure 
supported in R" \ yJ with y > 2, and for each J! EH, let vp = gyv with |gy| < 1. 
Let T® be a standard a-fractional singular integral operator with 0 < a <n. Then 
with 6! = 3 we have 


> (e (vy) 3 AGW), 


JVEH 


1 PUL (J, v) 
SP a lle op ae 
L( ) 8 MG | H 


P* (J, v) 3 
S IMallee) (“e") Pe ce 


OL 
S alle) (=2) Pex) 


D(a) 


L?(w)’ 


1 
n 
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and in particular the ‘pivotal’ bound 


[(7" (v) , Badol S CMMs lle) P* FY) Vo: 


Remark 5.3 The first term on the right side of the energy inequality above is the 
‘big’ Poisson integral P® times the ‘small’ energy term | Pal Foe that is additive 
in H, while the second term on the right is the ‘small’ Poisson integral Pf, ;, times 
the ‘big’ energy term | Pox | 2) that is no longer additive in H. The first term 
presents no problems in subsequent analysis due solely to the additivity of the 
‘small’ energy term. It is the second term that must be handled by special methods. 
For example, in the Intertwining Proposition below, the interaction of the singular 
integral occurs with a pair of quasicubes J C J at highly separated levels, where the 
goodness of J can exploit the decay 5’ in the kernel of the ‘small’ Poisson integral 
P{, » relative to the “big’ Poisson integral P“, and results in a bound directly by the 
quasienergy condition. On the other hand, in the local recursion of M. Lacey at the 
end of the paper, the separation of levels in the pairs J C I can be as little as a fixed 
parameter p, and here we must first separate the stopping form into two sublinear 
forms that involve the two estimates respectively. The form corresponding to the 
smaller Poisson integral P{, ;, is again handled using goodness and the decay 6’ in 
the kernel, while the form corresponding to the larger Poisson integral P” requires 
the stopping time and recursion argument of M. Lacey. 


Preliminaries of NTV Type 


An important reduction of our theorem is delivered by the following two lemmas, 
the first of which is due to Nazarov, Treil and Volberg in the case of one dimension 
(see [21] and [35]), and the second of which is a bilinear Carleson embedding. 
The proofs given there do not extend in standard ways to higher dimensions with 
common point masses, and we use the quasiweak boundedness property to handle 
the case of touching quasicubes, and an application of Schur’s Lemma to handle 
the case of separated quasicubes. The first lemma below is Lemmas 8.1 and 8.7 in 
[18] but with the larger constant Aj there in place of the smaller constant AS here. 
We emphasize that only the offset AJ condition is needed with testing and weak 
boundedness in these preliminary estimates. 


Lemma 6.1 Suppose T% is a standard fractional singular integral with 0 < a <n, 
and that all of the quasicubes I € QD? , J € QD® below are good with goodness 
parameters € and r. Fix a positive integer p > r. For f € L? (a) and g € L? (w) we 
have 


> (7 (ATA) APs), S (Sa + TE + WEP re + VAS) liflli2cy lsll2c) 
(NEQD? XQD® 
2 PL(T)<L) <2? LL) 
(24) 
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and 


(7S (ATA) AF8),1 S VAE Mlle llgllawy + (25) 
(LJ EQD? xQD" 


= LV) = 
INJ=@ and un #2 P| 


where the implied constants depend only on n, a and T°. 


Lemma 6.2 Suppose T% is a standard fractional singular integral with 0 < a <n, 
that all of the quasicubes I € QD°,J € QD® below are good, that p > Yr, that 
f € LV’ (0) and g € L’ (@), that F C QD? and G C QD” are o-Carleson and 
w-Carleson collections respectively, i.e., 


> lFiSlFle FeFand YS) (Gl, S|Gy. GEG, 
FleF. F’CF GEG: GCG 


that there are numerical sequences {a (F)} rex and {Bg (G)}geg such that 


Yo ar (FY Fle < Willie) and D5 Bo (GIGI, < IIgligg), (26) 
FEF GEG 


and finally that for each pair of quasicubes (I, J) € QD® x QD”, there are bounded 
functions B;z and y; supported in I \ 2J and J \ 2I respectively, satisfying 


Brslhoo sWYralloo S 1. 


Then 
a (7% (Brslrox (F)) , A?e),| aT » (7% (AZP) . v1.616Be (G)),,| 
(F.NEFXQD® (,G)END?xG 
FOI=6 and l(1)<27P &(F) ING=O and U1) <2- G6) 
Ss VAE Illi IIglle2() - (27) 


See [29] for complete details of the proofs when common point masses are 
permitted. 


Remark 6.3 If F and G are o-Carleson and w-Carleson collections respectively, 
and ifar(F) = Ef|f| and Bg (G) = E@|g|, then the ‘quasi’ orthogonality 
condition (26) holds (here ‘quasi’ has a different meaning than quasi), and this 
special case of Lemma 6.2 serves as a basic example. 


Remark 6.4 Lemmas 6.1 and 6.2 differ mainly in that an orthogonal collection of 
quasiHaar projections is replaced by a ‘quasi’ orthogonal collection of indicators 
{lra#z (F)} <7. More precisely, the main difference between (25) and (27) is that a 
quasiHaar projection A’ or A? g has been replaced with a constant multiple of an 
indicator 1a (F) or 1gBg (G), and in addition, a bounded function is permitted to 
multiply the indicator of the quasicube having larger sidelength. 
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Corona Decompositions and Splittings 


We will use two different corona constructions, namely a Calderén-Zygmund 
decomposition and an energy decomposition of NTV type, to reduce matters to the 
stopping form, the main part of which is handled by Lacey’s recursion argument. 
We will then iterate these coronas into a double corona. We first recall our basic 
setup. For convenience in notation we will sometimes suppress the dependence on 
a in our nonlinear forms, but will retain it in the operators, Poisson integrals and 
constants. We will assume that the good/bad quasicube machinery of Nazarov, Treil 


and Volberg [35] is in force here as in [30]. Let QD° = QD® be an (r, €)-good 
quasigrid on R", and let ae _ 


quasiHaar bases as described above, so that 


and {ny be corresponding 
JEQD®, bel, 


f= D> Affandg= D> Ags, 


IeQD* JEQD® 


where the quasiHaar projections Aff and Af g vanish if the quasicubes / and J are 
not good. Recall that we must show the bilinear inequality (17), i.e. |7* (f,g)| < 
Mee fll) Iglzey- 

We now proceed for the remainder of this section to follow the development 
in [30], pointing out just the highlights, and referring to [30] for proofs, when no 
changes are required by the inclusion of quasicubes and common point masses. 


The Calder6n-Zygmund Corona 


We now introduce a stopping tree F for the function f € L? (a). Let Fbe acollection 
of Calderén-Zygmund stopping quasicubes for f, and let QD° = LU Cr be the 


FEF 
associated corona decomposition of the dyadic quasigrid QD. See below and also 


[30] for the standard definitions of corona, etc. 

For a quasicube J € QD® let mgpeI be the QD*-parent of J in the quasigrid 
QD*, and let mI be the smallest member of F that contains J. For F, F’ € F, we 
say that F’ is an F-child of F if t¢(2QpoF’) = F (it could be that F = megp°F’), 
and we denote by €z (F) the set of F-children of F. For F € Ff, define the projection 


oO . : is 0,a 
Pe. onto the linear span of the quasiHaar functions {hi \ 1eOae wets by 


Pegs AS Sa. 


I€Cr TéCr, a€l yn 
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The standard properties of these projections are 


sa ores f Pe Ao=0. Ula =O Ios: 


FeF FeF 


The Energy Corona 


We also impose a quasienergy corona decomposition as in [21] and [16]. 


Definition 7.1 Given a quasicube So, define S(So) to be the maximal subqua- 
sicubes J C Sp such that 


prubgood.o 
J 


Pe (7, teh) | 


one \ 2 | 
( T > Cenergy |e) AS age oe ; 
JEM —deep UW) JI " 


Lo) — 

(28) 
where €"°"® is the constant in the strong quasienergy condition defined in Defi- 
nition 2.6, and Cenergy is a sufficiently large positive constant depending only on 
t > r,n and aq. Then define the o-energy stopping quasicubes of Sp to be the 
collection 


S= {8} UJ S 


n=0 
where Sp = S(So) and S,41) = 'o S(S) forn > 0. 
SESy 


From the quasienergy condition in Definition 2.6 we obtain the o-Carleson 
estimate 


Y> Sle <2Ml,, Le QD’. (29) 
SES: SCI 


Finally, we record the reason for introducing quasienergy stopping times. If 


2 
P* (J, 1 : 
X, (an = sup A ~ ( (J, pe) | gies a (30) 


1eCs Io Fehde alll MG Lo) 


is (the square of) the a-stopping quasienergy of the weight pair (o, w) with respect 
to the corona Cs, then we have the stopping quasienergy bounds 


Xy (Cs) = af an (gato? +AS+A5PM™™ SES, (31) 


where AY + AS" and the strong quasienergy constant €"°"S are controlled by 
assumption. 
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General Stopping Data 


It is useful to extend our notion of corona decomposition to more general stopping 
data. Our general definition of stopping data will use a positive constant Co > 4. 


Definition 7.2. Suppose we are given a positive constant Cp > 4, a subset F of the 
dyadic quasigrid QD® (called the stopping times), and a corresponding sequence 
ay = {ar (F)}per Of nonnegative numbers a7 (F) > 0 (called the stopping data). 
Let (F, <, 1) be the tree structure on F inherited from Q2D°, and for each F € F 
denote by Cr = {1 € QD® : mrI = F} the corona associated with F: 


Ce = {1 € QD" :1C Fand/ ¢ F for any F’ < F}. 
We say the triple (Co, F, ax) constitutes stopping data for a function f € L},. (c) 


if 


1. Ef |f| < a (F) for all J € Cp and F € F, 
2. Vepr<rlF’le < ColFl, for all F € F, 


3 pero) IF |, = CG Illia)» 
4. ar (F) < aF(F’) meneven € Fwith F’ CF. 


Definition 7.3 If (Co, F, a+) constitutes (general) stopping data for a function f € 
(o), we refer to the orthogonal decomposition 


f= >. Pete Pe f= >, APF, 


FeF I€Cr 


Loe 


as the (general) corona decomposition of f associated with the stopping times F. 


Property (1) says that a+ (F) bounds the quasiaverages of f in the corona Cr, and 
property (2) says that the quasicubes at the tops of the coronas satisfy a Carleson 
condition relative to the weight o. Note that a standard ‘maximal quasicube’ 
argument extends the Carleson condition in property (2) to the inequality 


» F’|, < Co |A|, for all open sets A C R”. (32) 
F’€F: F’'CA 


Property (3) is the ‘quasi’ orthogonality condition that says the sequence of 
functions {az (F) 1p};¢F is in the vector-valued space L? (€7; 0), and property (4) 
says that the control on stopping data is nondecreasing on the stopping tree F. We 
emphasize that we are not assuming in this definition the stronger property that there 
is C > 1 such that az (F") > Coz (F) whenever F’, F € F with F’ G F. Instead, the 
properties (2) and (3) substitute for this lack. Of course the stronger property does 
hold for the familiar Calderén-Zygmund stopping data determined by the following 
requirements for C > 1, 
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oy If| > CE® |f| whenever F’,F €¢ FwithF’ OF, E7 |f| < CEZ|f| forl € Cp, 


which are themselves sufficiently strong to automatically force properties (2) and 
(3) with a¢ (F) = EF If]. 

We have the following useful consequence of (2) and (3) that says the sequence 
{az (F) 1p}pezx has a ‘quasi’ orthogonal property relative to f with a constant C4 
depending only on Co: 


2 


Yo ax (F) Ie 


FeF 


< Cy llfllie)- (33) 


L(a) 


We will use a construction that permits iteration of general corona decomposi- 
tions. 


Lemma 7.4 Suppose that (Co, F, az) constitutes stopping data for a function f € 
Liye (0), and that for each F € F, (Co, K(F),ax(e)) constitutes stopping data 
for the corona projection P@,,f, where in addition F € K (F). There is a positive 
constant C, depending only on Co, such that if 


K* (F) = {K €K(F) Cp: ax (K) >= aF(F)} 


K=(e* uf}, 
FeF 
ac(r) (K) for K€ K* (F)\ ey 


ee = max {az (F) 0x (F) (F)} for K=F 


forF €f, 


the triple (C,, K, ax) constitutes stopping data for f. We refer to the collection of 
quasicubes K as the iterated stopping times, and to the orthogonal decomposition 
f = dxex Pox f as the iterated corona decomposition of f, where 


Ce = {1€QD:1C Kand! ¢ K' for K’ <x K}. 


Note that in our definition of (C;,K, a) we have ‘discarded’ from K (F) all of 
those K € K (F) that are not in the corona Cr, and also all of those K € K (F) 
for which ax) (K) is strictly less than a-(F). Then the union over F of what 
remains is our new collection of stopping times. We then define stopping data 
atx (K) according to whether or not K € F: if K ¢ F but K &€ Cpr then ax (K) 
equals or) (K), while if K € F, then ax (K) is the larger of ajc) (F) and a7 (K). 
See [30] for a proof. 
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Doubly Iterated Coronas and the NTV Quasicube Size Splitting 


Let 


NIV. = (ee Aa pa. 


Here is a brief schematic diagram of the decompositions, with bounds in 


(TSH. 8) 
Be, fg) + Bag) + Baf.g) + BY G&f.a) 
4 dual NTVy, NTVo 


\ 
Taiagonal (f, g) + Tearbelow ff, g) + Tearabove (f, g) + Taisjoint (f, g) 


| | 0 0 
1 . 

Bé, (f,g) Tibeee Ure) OF Mean 2) 
1 NTVq + EXtone NTVo 


\ 
Boe (f, 8) Ie Bena (f, 8) ay Bh eighbour (f, 8) 


We begin with the NTV quasicube size splitting of the inner product (T¢f, g) 
— and later apply the iterated corona construction to the Calderén—Zygmund corona 
and the energy corona in order to bound the below form Be, (f, g) — that splits the 
pairs of quasicubes (/, J) in a simultaneous quasiHaar decomposition of f and g into 
four groups, namely those pairs that: 


. are below the size diagonal and p-deeply embedded, 
. are above the size diagonal and p-deeply embedded, 
. are disjoint, and 

. are of p-comparable size. 


BRWN re 


More precisely we have 


(ef, = = (tg (AFA). (APs), 


1EQD?, JEQD® 


(HALA. AF)).+ DD (TE (ATA. (AS2)). 


1EQD? , JEQD® 1EQD?, JEQD® 
JEpl Jp>I 
+ Dl (GAIA. (AS)),+ = DE (TE (ATA). (AF8))., 
IEQD®, JEQD? IEQD®, JEQD? 
JINI=6 2 P<) JEM) S2? 


Be, (f.3) +B,5 (f.8) + Bn ff.g) + By &f.8). 
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Lemma 6.1 in the section on NTV preliminaries show that the disjoint and 
comparable forms Bp (f, g) and B , (f, g) are both bounded by the AS + AS", 
quasitesting and quasiweak boundedness property constants. The below and above 
forms are clearly symmetric, so we need only consider the form Be, (f, g), to which 
we turn for the remainder of the proof. For this we need functional energy. 


Definition 7.5 Let §, be the smallest constant in the ‘functional quasienergy’ 
inequality below, holding for all h € L? (c) and all o-Carleson collections F with 
Carleson norm C¢ bounded by a fixed constant C: 


2 
5 Eg TH) Penns 


FEF JEM y.¢)—deep(F) || 


2. 


Salltlliz. G4 
L?() 


Several ingredients now come into play in order to reduce control of the 
below form Be, (f,g) to the functional energy constant F, and the stopping form 


Beal e)3 


1. starting with the doubly iterated corona of Calderén-Zygmund and energy in 
Lemma 7.4 in order to obtain the decomposition into Tuiagonals Tfarbelow>  farabove 
and Taisjoint 

2. continuing with an adaptation of the Intertwining Proposition from [30] to 
include quasicubes and common point masses so as to bound the forms Th, nelow 

and Tz? nelow (f+ g) using the functional energy constant Fy, 


3. and followed by the NTV decomposition into paraproduct, neighbour and 
stopping forms. 


The adaptation of the Intertwining Proposition to include quasicubes and com- 
mon point masses is easy because the measures w and o only ‘see each other’ in the 
proof through the energy Muckenhoupt conditions AS" and AS""""*”, and the 
straightforward details can be found in [29]. Thus we now turn to the difficult task 
of controlling the functional energy constant Fy, by the Muckenhoupt and energy 
side conditions. 


Control of Functional Energy by Energy Modulo A; 
and iia 


Now we atrive at one of our main propositions in the proof of our theorem. We show 
that the functional quasienergy constants $, as in (34) are controlled by AS, Aj" 
and both the strong quasienergy constant €'"°"* defined in Definition 2.6. The 
proof of this fact is further complicated when common point masses are permitted, 
accounting for the inclusion of the punctured Muckenhoupt condition ASP". But 
apart from this difference, the proof here is essentially the same as that in [30], 


where common point masses were prohibited. As a consequence we will refer to 
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[30] in many of the places where the arguments are unchanged. A complete and 
detailed proof can of course be found in [29]. 


Proposition 8.1 We have 


Ba Se + (43 + jas" + qaene and 3 < Egos ia + ya + oe 


To prove this proposition, we fix F as in (34), and set 


1 
2D DL [Peal Sounamiarenn Zan. 69 
FEF JEM (¢,¢)—deep (F) 


where M (r,¢)—deep (F) consists of the maximal r-deeply embedded subquasicubes of 
F, and where 5, ¢(yy) denotes the Dirac unit mass at the point (cy, ¢ (J)) in the upper 
half-space RYT, Here J is a dyadic quasicube with center c,; and side length @ (J). 
For convenience in notation, we denote for any dyadic quasicube J the localized 


* * o . 
projection P good, —shift , given by 
ce J 
1a} — 1} @ 
Poy = P ogoade—shin, me ) Jl: 


_ good,z —shift 
ICS: eck 


We emphasize that the quasicubes J € M@¢)—deep (F) are not necessarily good, 
but that the subquasicubes J’ C J arising in the projection P? , are good. We can 
replace x by x — c inside the projection for any choice of ¢ we wish; the projection 
is unchanged. More generally, 5, denotes a Dirac unit mass at a point q in the upper 
half-space RYT 

We prove the two-weight inequality 


IP Foe S (EM + VAL + VAS + VAI) Ilr) G0 


for all nonnegative f in L? (co), noting that F and f are not related here. Above, P* (-) 
denotes the w-fractional Poisson extension to the upper half-space Re’ 


t 


Pas fi eet 0). 
(2 + lr») 


so that in particular 


2 
Xx 
1 
n 


) 
Pry 


IPPON cert =e dP Fo) (CW). €))? 


FEF JEeMy—deep (F) 


Lo) 
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and so (36) proves the first line in Proposition 8.1 upon inspecting (34). Note also 
that we can equivalently write ||P% (fo)|| pet p) = ||P (fo) | RR) where 


Py (x,t) = {Pv (x, f) is the renormalized Poisson operator. Here we have simply 


shifted the factor + in 7 to [Pe (fo)| instead, and we will do this shifting often 
throughout the proof when it is convenient to do so. 

The characterization of the two-weight inequality for fractional and Poisson 
integrals in [24] was stated in terms of the collection P” of cubes in R” with 
sides parallel to the coordinate axes. It is a routine matter to pullback the Poisson 
inequality under a globally biLipschitz map Q : R” — R", then apply the theorem 
in [24] (as a black box), and then to pushforward the conclusions of the theorems so 
as to extend these characterizations of fractional and Poisson integral inequalities to 
the setting of quasicubes Q € QP" and quasitents Q x [0,£(Q)] Cc Rv with 
Q € QP". Using this extended theorem for the two-weight Poisson inequality, 
we see that inequality (36) requires checking these two inequalities for dyadic 


quasicubes J € QD and quasiboxes 1 = | x [0, £ (J) in the upper half-space Re 


[ vay PY 0) (1? dE (x, 1) = IP Lo Ido) S (Eats)? + AE + AE* + Ap") oD, 
(37) 


i. [Q"(APdoe) < ((E")" + AS + AS +457") [ean 1), (8) 
R" I 


for all dyadic quasicubes I € QD, and where the dual Poisson operator Q* is 
given by 


2 


arn) = f Yd.) . 


te + l=) 


It is important to note that we can choose for QD any fixed dyadic quasigrid, the 
compensating point being that the integrations on the left sides of (37) and (38) are 
taken over the entire spaces Ry and R” respectively. 


Remark 8.2 There is a gap in the proof of the Poisson inequality at the top of 
page 542 in [24]. However, this gap can be fixed as in [33, p. 861]. 


Poisson Testing 


We now turn to proving the Poisson testing conditions (37) and (38). The same 
testing conditions have been considered in [28] but in the setting of no common 
point masses, and the proofs there carry over to the situation here, but careful 
attention must now be paid to the possibility of common point masses. In [6] 
Hyto6nen circumvented this difficulty by introducing a Poisson operator ‘with holes’, 
which was then analyzed using shifted dyadic grids, but part of his argument was 
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heavily dependent on the dimension being n = 1, and the extension of this argument 
to higher dimensions is feasible (see earlier versions of this paper on the arXiv), 
but technically very involved. We circumvent the difficulty of permitting common 
point masses here instead by using the energy Muckenhoupt constants A" and 
AS* ny which require control by the punctured Muckenhoupt constants AS?" 
and ASP". The following elementary Poisson inequalities (see e.g. [35]) will be 


used extensively. 


Lemma 8.3 Suppose that J,K,I are quasicubes in IR", and that is a positive 
measure supported in R" \ I. If J C K C 2K CI, then 


Pe.) — P°(K,m) — PX. W) 


af 1 1 
JI" |K|" Jl" 


while if2] C K CI, then 


Pe(Kew) — POU) 


1 T 
|K| n |J| n 
Now we record the bounded overlap of the projections P?% ;. 


Lemma 8.4 Suppose PRy is as above and fix any In € QD, so that Ip, F and J all 
lie in a common quasigrid. If J € M(r.c)—deep (F) for some F € F with F 2 In D J 
and P? , 4 0, then 


F= Ip for some O0<f<t. 
As a consequence we have the bounded overlap, 
# {F EeF:Jch g F for some J € M¢,¢)—deep (F) with PE, # o} <T. 


Finally we record the only places in the proof where the refined quasienergy 
conditions are used. This lemma will be used in bounding both of the local Poisson 
testing conditions. Recall that AQD consists of all alternate (2.D-dyadic quasicubes 
where K is alternate dyadic if it is a union of 2” QD-dyadic quasicubes K’ with 
£(K’) = se (K). See [30] for a proof when common point masses are prohibited, 
and the presence of common point masses here requires no change. 


Remark 8.5 The following lemma is another of the key results on the way to the 
proof of our theorem, and is an analogue of the corresponding lemma from [28], but 
with the right hand side involving only the plugged energy constants and the energy 
Muckenhoupt constants. 
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Lemma 8.6 Let QD, F C QD be quasigrids and {Pz ,t FEF be as above 
, JEM (¢.c)—dee (F) 
with J, F in the dyadic quasigrid QD. For any alternate quasicube id € AQD define 


Px (7,1 
BI) = > 3 (meer) [PEI Joc - 


FeF: F2QY for some V €€(1) JEM (,¢)—deep (F): JCI [J] 
(39) 
Then 


B(1) - t (ey + Ayer) Zo (40) 


The Forward Poisson Testing Inequality 


Fix J € QD. We split the integration on the left side of (37) into a local and global 
piece: 


i. ae (jo)? du = [ee (ljo)* dj + —_ 7 P* (1,0)? djt = Local (J) + Global (/) , 
R" n 
+ 


where more explicitly, 


1 
=5h, I) 


ile. = y 2 nae = > | Pe axle) . 8 (cy) 


JEQD FEF JEM (¢,¢)—deep F) 


ik aia i 


Here is a brief schematic diagram of the decompositions, with bounds in ;, used in 
this subsection: 


Local (/) 


i 
Local’? (7) + Local"* (7) 
2 


| 
A + B 


(egrn8) 2 (Esttons)? ee AS ,energy 
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and 
Global (J) 
a 
A + B + °C + D 
Ag Ag Oe eas a aa Aye Aces 


An important consequence of the fact that J and J lie in the same quasigrid 
QD = QD", is that 


(c(J),£()) € Tif and only if J C 1. (42) 


We thus have 


Local (J) = / P* (10) (x,t)? dj (x, t) 
I 

2 

om x 

Pre 1 

[J|" 


= YL Pao) (culty 


FEF JEMy—deep(F): JCI 


00 eee 
re S- » P* (VJ, 10) IPRs Tlie 


FEF JE My—deep(F): JCI lJ| 


Lo) 


< Local"? (J) + Local” (J) , 


where the ‘plugged’ local sum Local?" (7) is given by 


Local’"® (1) = py Py (Pete) Pr Xe 


FEF JEMy—deep(F): JCI |J|" 


P? (J, 1 
a Sh ek ~ (Pete) IP2 xl 


FEF: FCI FEF: FQI) J€Mr—deop(F): JCI J | 
=A+B. 


Then a trivial application of the deep quasienergy condition (where ‘trivial’ means 
that the outer decomposition is just a single quasicube) gives 


P* (J,1 
bs » (mutenn) | Pr eee 


FEF: FCI JEMy—deep (F) \J| : 


< (ects)? FI, < S (Esrons)? I, ; 


FeF: FCI 
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2 
< | Be ox | a where we recall that the quasienergy constant 
L*(@) 


2 
4 a} 
since | Pr ll) < 


Eston’ igs defined in Definition 2.6. We also used that the stopping quasicubes F 
satisfy a o-Carleson measure estimate, 


>) Fle SlFole 


FEF: FCFo 


Lemma 8.6 applies to the remaining term B to obtain the bound 
B < t (Gao + Agere’) IT, 


It remains then to show the inequality with ‘holes’, where the support of o is 
restricted to the complement of the quasicube F. Thus for J € M(r,¢)—deep (F') we 
may use / \ F in the argument of the Poisson integral. We consider 


P* (J, Inpro 
Local’ (1) = > >. (Fear) |Pr Eee 
FEF JEM (e.¢)—deop(F): JCI |J|” 
Lemma 8.7 We have 
Local" (1) < (E™2)" [7], - (43) 


Details are left to the reader,or see [30] or [29] for a proof. This completes the 
proof of 


(44) 


P* (J, 1 
Local (L) ~ Py > (Mujer) | PIX 7246) 


FEF JEM r.¢)—deep(F): ICL [al 


S (er)? + age") [L|,,, Le QD. 


The Alternate Local Estimate 


For future use, we prove a strengthening of the local estimate Local (L) to alternate 
quasicubes M € AQD. 
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Lemma 8.8 With notation as above and M € AQD an alternate quasicube, we 
have 


(45) 


Local (M) = ) > oz (Pew) Pe Allirw 


FEF JEM (e.6)—deep(F): JCM lal 


< (a: + A?) \M|,, Me AQD. 


Again details are left to the reader, or see [30] or [29] for a proof. 


The Global Estimate 


Now we turn to proving the following estimate for the global part of the first testing 
condition (37): 


Global (/) = i ie P* (1jo)* dit S AS |I\, 
+ 


We begin by decomposing the integral on the right into four pieces. As a particular 
consequence of Lemma 8.4, we note that given J, there are at most a fixed number 
t of F € F such that J € My—deep (F). We have: 


2 


x 
fn Barus LY Kaeo YD [Pes 
wth ae lal" Le 
+ Ji (ey LINER NF TEMS ep(F) Lo) 
2 
x x x x x 
= an + + +P Ao)(cneMOyr Ye —_ 
JO3I= JON JIN= JQI FEF: ly|" L(w) 
tuyeeo) isu JEM (rs) —deep (F) 
=A+B+CH+D. 


Terms A, B and C are handled almost the same as in [30], and we leave them for 
the reader. As always complete details are in [29]. 

Finally, we turn to term D which is significantly different due to the presence of 
common point masses, more precisely a new ‘preparation to puncture’ argument 
arises which is explained in detail below. The quasicubes J occurring here are 


included in the set of ancestors A, = | of lL l<k<o. 
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2 
[oe] 
1\2 x 
D = >> P* (10) (c (Ad) . |Aa| ') > —— 
k=1 FEF: |Ax|* Dw) 
AcE M (r,)—deep (F) 
[oe] 1\2 2 
= Po) (ca). 1Al®) Ps Ay 7" , 

a MEM teep(®) TeCpoer CAI NP (@) 

oe) 2 2 

i x 

+P (yo) (cy Al") y ver 

= ; ood, — shi KI" Wyo 

= Ake Mind aeep(P) ECP nhs cy Lo) 

co 12 2 
+ OP" (io) (cy). MAl") > ee 7" : 

= Mee aa) Hepat Mts 1S ICA KI" Wi2(o) 

= Daisjoint So Deescendent + Dancestor- 
We thus have from Lemma 8.4 again, 
CO : 2 
Daisoint = >_ P* (10) (c (Ax). [Acl") 
k=1 
2 
x 
(a) 
- 2 a as 
FeF: 7 - pg00d,t —shift, 7, kl” 2) 
Ak€M (r,2)—deep (F) a Cp CAM 


co co 1—2 
Mla Lal ul ia 
sy (Hee : TAK \ My =) ie Daas [Ak Flog Mo 


lea _ 
er Ve Ws pat Ax (20 
I 
< es PH, ino) Mle S TAS" Mle 
since 
| ln 
MeV tle = f Lar () doo (x) 
fan [ARIPO tat (Ag) 
ioe) 1-4 
1 ol 
ip 2 a0 Fe pas ay 40) 
i n—-a 
Tln 
<| ; lI ; do (x) = P* (I, 1pew). 
I [i + quasidist (x, D| 


The next term Dgescendent Satisfies 
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[o-e) [Z| |A | 1,2 2 
y k\" d, x 
D descendent Ss (er) t peo O) : 


j=1 ZEIT" 12) 


2 2 
=T ae veo (He IT\, psood.o X 
an 1-2 I 1 
|| [Z| Pw) 
2 
good x _ 
‘uae 


20-8 


[Z| 


oO 


L-(o) 


eA 
ea 


Mle S rA5™ IU, 


ow 


Finally for Dancestor We note that each J’ is of the form J’ = Ay = ee for 
some £ > 1, and that there are at most Ct pairs (F,A,) with k > & such that 


Ax € Mee)—deep (F) and J’ = Ay € Gc tshift Now we write 


2 


o 


"VAl* 


co 
Dancestor = SF (1,0) (c (Ax) ‘ |Ax| ) > ‘. 
k=1 


good, —shift , 
€Cr IGS CAk 


FeF: y L(w) 
AcE M (r)—deep (F) 


By Ree » 2 
He |Axl” o _* 
Se) [+ a ANTE 
k=1 \IAg (=1 KI" W12(w) 
a ee 2 
Tle |Ax|" gooda X 
<T > (Ae Ak Ale : 
k=1 \IAkl "7 cl” | 22() 


It is at this point that we must invoke a new ‘prepare to puncture’ argument. Now 
define @ = w — w ({p}) 6, where p is an atomic point in J for which 


o(ip})= sup (tq}). 


IEBo,0)' qel 


(If w has no atomic point in common with o in J set @ = w.) Then we have |/|5 = 
(I, Bio.) and 


x Wl, o (I, Bow) |, < Am Punt 


= 
1O-®) 0) 
A key observation, already noted in the proof of Lemma 3.2 above, is that 


[Ag @-P[nq) if re 


[Ax EXll72, = 2 
- || Agx pe if _ 


<(K)|Klz, forall K € QD, 


(46) 
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and so, as in the proof of Lemma 3.2, 


2 
good, x 
Ak 1 
|Ax|” L(a) 


< 3/Als. 


Then we continue with 


1\2 2 
= [I oO |Ax| " psood.o x 
¥ » Ad Ak 


[Axl 


k=1 L?(@) 
2 2 
IT|, |Ax| 
: “>( oily asta 
jai \ Ag 


II 
¥ 


0 ia. cs ul : 
& ax 
—~7 } Ac \Jln +7 ——_—_ ]} ls 
= (Hee | 7 2 Qkn—c) weg ic 


< 7 Cy + Ager) IZ ; 


2 
where the inequality }°7° , ( We ) |Ax \ I < AS* ||, is already proved above 


[Al 
in the estimate for Dgisjoint- 


The Backward Poisson Testing Inequality 


Fix J € QD. It suffices to prove 


Back (7) = is [Q* (Gz) of} do(y) < {4s ao (ern + a) [age [ean t). (47) 


Note that in dimension n = 1, Hyténen obtained in [6] the simpler bound A> for the 
term analogous to (47). Here is a brief schematic diagram of the decompositions, 
with bounds in, used in this subsection: 


Back (7) 
+ 
Us 
+ 
cppreximal + yremote 
Art 
(em rm a=) [yao } 
1 
Tiiference + intersection 
s $ 
Ait 


2 f / ,,punct 
(ene —) [ acrpamet (a+ ape) AS pun 
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Using (42) we see that the integral on the right hand side of (47) is 


feam=T IPR. (48) 


FeF JeM, (re) —deep (F): JCI 


where P¢? , was defined earlier. 
We now compute using (42) again that 


2 
a" (a7) 0) = [ ee dit (1) (49) 
"(2 + jx-yl’) 2 


>? 


a n+l-—a’ 
FEF JEM c)ccop(F) (WI tt ly— cil) 
JcI 


Pe 


lps 


and then expand the square and integrate to obtain that the term Back (7) is 


2 


2 a) 
| PRIX 00) | Pra Lo) d 
Ss » Re 1 n+1—a 1 n+1l—a a (y). 
FeF F'EF: an - ) ( ‘G = ) 
rete yen Foe” (Wt + bel |" + yer 
JCI 


J’cl 


By symmetry we may assume that £ (J’) < £ (J). We fix an integer s, and consider 
those quasicubes J and J’ with ¢ (J’) = 27~*€ (J). For fixed s we will control the 
expression 


Zp? > 


F\F'EF JEM (r.2)—deep (F), JEM (r,e)—deep (F”) 
IS CL, LS )=2- 4) 


2 


Lo) 


(a) 
[PEL [Pros 
a a 1 +1—a 1 n+1—a do (y) : 
(ul +bp—el)” (ut + yer) 


by proving that 


1 
—éds a strong a,ener; a@,punct _ 
U, <2 ee + (« + A =) pare [ean where 6 = an 


(50) 


With this accomplished, we can sum in s > 0 to control the term Back (7). The 
remaining details of the proof are very similar to the corresponding arguments in 
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[30], with the only exception being the repeated use of the ‘prepare to puncture’ 
argument above whenever the measures o and w can ‘see each other’ in an estimate. 
We refer the reader to [29] for complete details.° 


The Stopping Form 


This section is virtually unchanged from the corresponding section in [30], so we 
content ourselves with a brief recollection. In the one-dimensional setting of the 
Hilbert transform, Hyt6nen [6] observed that “...the innovative verification of the 
local estimate by Lacey [8] is already set up in such a way that it is ready for us to 
borrow as a black box.” The same observation carried over in spirit regarding the 
adaptation of Lacey’s recursion and stopping time to proving the local estimate in 
[30]. However, that adaptation involved the splitting of the stopping form into two 
sublinear forms, the first handled by methods in [16], and the second by the methods 
in [8]. The arguments are little changed when including common point masses, and 
we leave them for the reader (or see [29] for the proofs written out in detail). 


Energy Dispersed Measures 


In this final section we prove that the energy side conditions in our main theorem 
hold if both measures are appropriately energy dispersed. We begin with the 
definitions of energy dispersed and reversal of energy. 


Energy Dispersed Measures and Reversal of Energy 


Let jz be a locally finite positive Borel measure on R”. Recall that for 0 < k < n, we 
denote by £{ the collection of all k-dimensional planes in R”, and for a quasicube 
J, we define the k-dimensional second moment Mj (J, 4) of fz on J by 


Mi (J, 4)? = inf i dist (x, L)? du (x) . 
J 


Lely 
Finally we defined jz to be k-energy dispersed if there is c > 0 such that 


Mr (J, 2) = cM5 (VJ, 4), — for all quasicubes J in R”. 


°In [28] and [30] the bound for term B in the global estimate was mistakenly claimed without proof 


to be simply A} instead of the correct bound A$ + (a + (ar) Ae given in [29]. 


Two Weight Boundedness 361 


In order to introduce a useful reformulation of the k-dimensional second moment, 
we will use the observation that minimizing k-planes L pass through the center 
of mass. More precisely, for any k-plane L € Li such that /’, dist (x, L) du (x) 
is minimized, where A is a set of positive jz-measure, we claim that 


Wes EL. 
Indeed, if we rotate coordinates so that L = {(x', ne at j@") : (i, me x) 
E R*}, then 
a ; \2 
; 2 = j_ qi 
[aisve L)* du (x) Joos (x ) dt (x) 
le \2 ; ; wy) 
= j 2d | x di 
ZLO) du (x) — 2 [ Han + ( ) [veo 
- > o: ( ) ai)’ —2laPae®) 
2 I (x)'aneo + (f anes) )(d) -2 frau) 
is minimized over a‘*+!,..., a” when 


# i Jed (x) = ( = fb y 


an PRET Ss (py kt1l<j<n. 
J, dua) a 


This shows that the point E/\x belongs to the k-plane L. 

Now we can obtain our reformulation of the k-dimensional second moment. Let 
S; denote the collection of k-dimenional subspaces in R”. If Ps denotes orthogonal 
projection onto the subspace S € S/_, where § = Es and Lo € Sj is the subspace 
parallel to L, then we have the variance identity, 


Mi (J, 4)? = inf / dist (x, L)? du (x) = _ inf / |Psx — Ps (Efx) |? du (x) (51) 
Lely J SES! 5 


1 
— inf 7 
2 ses"_, = ff ips Psy? dt (x) du (y) 


g ee Wn ~ [ [as (x, Lo + y)* du (x) du (y), 


since Ps ( a x) = (Psx). Here we have used in the first line the fact that the 
minimizing k-planes L pass through the center of mass 7 x of x in J. 

Note that if ~2 is supported on a k-dimensional plane L in R”, then Mi (J, jz) 
vanishes for all quasicubes J. On the other hand, Mj (J, 2) is positive for any 
quasicube J on which the restriction of 44 is not a point mass, and we conclude 
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that measures jz supported on a k-plane. and whose restriction to J is not a point 
mass, are not k-energy dispersed. Thus Mj (J, 2) measures the extent to which a 
certain ‘energy’ of jz is not localized to a k-plane. In this final section we will prove 
the necessity of the energy conditions for boundedness of the vector Riesz transform 
R°” when the locally finite Borel measures o and w on R” are k-energy dispersed 
with 


n-k<a<n,a4¢n-—1 if l1<k<n-2 (52) 
O<a<n,a4¢l,n—-1 if k=n-1 


Now we recall the definition of strong energy reversal from [25]. We say that a 
vector T® = { Te of w-fractional transforms in the plane has strong reversal of 
q@-energy on a cube J if there is a positive constant Cp such that for all 2 < y < 


2"(—®) and for all positive measures ju supported outside yJ, we have the inequality 


2 
nw) nw \2 P* (VJ, L) cv zo |qO pdorpa |? 
uy [(«- a7 X) | (=e = E(VJ,w)P Gay = Co EP |T* pn — Ed T L| : 

(53) 

Now note that if w is k-energy dispersed, then we have 

2 1 n 2. 1 n 2 2 
E(J,@)° = ——zM) Vo) S$ — 3M, V0) = Ex. Vo)", 
Mle WI" lo WI" 


and where we have defined on the right hand side the analogous notion of energy 
E, (J, @) in terms of M; (J, @), and which is smaller than E (J, w). We now state the 
main result of this first subsection. 


Lemma 10.1 Let 0 < a < n. Suppose that w is k-energy dispersed and that k 
and a satisfy (52). Then the a-fractional Riesz transform R®”" = sae an has 
strong reversal (53) of w-energy on all cubes J provided y is chosen large enough 


depending only on n and a. 


In [27] we showed that energy reversal can fail spectacularly for measures in 
general, but left open the possibility of reversing at least one direction in the energy 
for R°” when a # 1 in the plane n = 2, and we will show in the next subsection that 
this is indeed possible, with even more directions included in higher dimensions. 


Fractional Riesz Transforms and Semi-harmonicity 


Now we fix 1 < @ < nand write x = (x,x”") with xX = (4,...,x¢) € R° and 
x” = (xe41,..-.Xn) € IR" (when £ = n we have x = x’). Then we compute for B 
real that 
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B B 
Ay |x/2 = Ay (|x? + x") eV (|x? + x") * 


B_ Bis 
f (|2’? “hs x"!’) z rae re (|x? i x") 2 | 


=2 


ba bo 
(Vex) (XP + fe?) 7 42 Ve (le P + be" P) 7 


= Vy . = BV . 


=p 


p-2 
= By e( (P+ RP) oe P= (WP +h)? 24 


B-4 ae 


CHP +E) (WE+ WF)? +O-abF (WR + Wr) | 


= 


‘ (|x? + vp)? +6- 2) |x’? (wwe)? 


= pI(0+B—2|x'|> + e|x"| (vee bP) > 


The case of interest for us is when 6B = a —n + 1, since then 
Rae [a|P S Vae « Veg x SV lS cape Ky 64) 


where K®”” is the vector convolution kernel of the w-fractional Riesz transform R®’”. 
Now if £ = 1 in this case, then the factor 


Fog (x) = (C+ B—2) |x| +e |x"? 


is (6 — 1) |x’? + |x”|°, and thus in dimension n > 2, the factor Fp (x) will be of 
one sign for all x if and only ifa—n+1= 6 > 1, i.e. a > n, which is of no use 
since the Riesz transform R®” is defined only for 0 < a <n. 

Thus we must assume £ > 2 and B = a—n-+ 1 whenn > 2. Under these 
assumptions, we then note that Fz (x) will be of one sign for all x if £-+ B —2 > 0, 
ie.a@ >n-+1-— 2, in which case we conclude that 


Ay perm] = lant {(e+a—n— de? + ele"? t (|x? + 2" 5 ae 55) 
a—n—1 
~ (|: + x") * ss |x[*" 1, fora #n—1. 
When £ = n, this shows that JAx pe ~ |x|"! fora > | witha £n—1. 


But in the case £ = n we can obtain more. Indeed, since x” is no longer present, we 
have for 0 < a < | that 
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Bs eee ~ ae ; 


(This includes dimension n = | but only for 0 < a < 1). 
We summarize these results as follows. For dimension n > 2 and x = (x’, x") 
with x € Ré and x” € R’~*, we have 


Av a ~ ner 


provided 


either 2 < £<n—landn+1—£<a<nwitha#n-1, (56) 


or £=nand0 <a <nwitha £1,n-1. 


Thus the two cases not included are a = 1 anda = n— 1. The case a = 1 is not 
included since |x|*-"*! = |x|? is the fundamental solution of the Laplacian for 
n > 2 and constant for n = 2. The case a = n—1 is not included since |x|*~"T! = 1 
is constant. 

So we now suppose that w and & are as in (56), and we consider £-planes L 
intersecting the cube J. Recall that the trace of a matrix is invariant under rotations. 
Thus for each such £-plane L, and for z € JNL, we have from (54) and (55), and with 


Pet!" (2) = fon |Z —yl*T'" du (y) denoting the convolution of |x|°T'™ with j, 
that 
a—n— P* GV, 
|V_R%"w (z)| = |traceV,R" yt (z)| = [ATT "pw (2)| © / ly —2|7-" | du (y) & ar 
(57) 


where V;, denotes the gradient in the £-plane L, ic. Vr = PsV where S is the 
subspace parallel to L and Ps is orthogonal projection onto S, and where we assume 
that the positive measure jz is supported outside the expanded cube yJ. 

We now claim that for every z € JM L, the full matrix gradient VR®” py (z) is 
‘missing’ at most £ — 1 ‘large’ directions, i.e. has at least n — £ + 1 eigenvalues 


each of size at least ch 4 # ) Indeed, to see this, suppose instead that the matrix 
ly] i 


VR" (z) has at most n — £ eigenvalues of size at least cP Uw pe) 


ly" 


. Then there is an 


£-dimensional subspace S such that 


P* (J, WL) 
——, 


J|* 


|VsR°" yu (2)| = (PsV) RO" (2)| = [Ps (VR (2))| S € 


which contradicts (57) if c is chosen small enough. This proves our claim, and 
moreover, it satisfies the quantitative quadratic estimate 


P* (VJ, L) 
MG 


|f- VR°" 2 @ E| = e—— EP, 
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for all vectors € in some (n — £ + 1)-dimensional subspace 


n—l+1 1 n—l+1 1 
Ss! Span {v.,...,V! YES ous, 


with w € S""! forl <j<n—-£+4+1. 
It is convenient at this point to let 


k=€-1, 
so that 1 < k < n—1 and the assumptions (56) become 
either 1 <k <n—2andn—k<a<nwitha#4n-1, (58) 
ork=n—1and0<a <nwitha 4 1,n—-1, 


and our conclusion becomes 


If VRE" (2) | > 


lg?, €eSh* zew. (59) 
|J|* 


Proof of Strong Reversal of Energy 


We are now in a position to prove the strong reversal of energy for Riesz transforms 
in Lemma 10.1. 


2 
Proof (of Lemma 10.1) Recall that Ey (J,@)? = infrecr qe f, (4) do (x) 
o Jin 


and 
1 di L : 1 1 1 di L ° 
t U(x, 
[( 2) eo=37 [Ff (EE) eowe. 

lo Jr ae 2 No Ja lo Jy J] 
(60) 

where we recall that Lo € Sj is parallel to L. The real matrix 
M (x) = VR“), 3=xeJ, (61) 


is a scalar multiple of the Hessian of ef? hence is symmetric, and so we can 
rotate coordinates to diagonalize the matrix, 


dG) 0 = 0 
M(@ = 0 Ar(x) 
. 0 
0 +++ 0 An (x) 
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where |A, (x)| < |Az2 (x)| < ... < [An (&)|. We now fix x = c; to be the center of J 
in the matrix M (c;) and fix the eigenvalues corresponding to M (c,): 


Ai] S|Ag] S---S|Anl, APSA), 


and define also the subspaces S”~' to be i for 1 <i < k. Note that we then have 
S"! = Span {e;41,...,€n}. Let L! be the i-plane 


Le=z+ (s"-)* a Cee sareees og) : eee | € R'}. (62) 
By (59) we have 


mG) 


1 


Arti] =e 


J\* 


For convenience define |A9| = 0 and then define 0 < m < k be the unique integer 
such that 


PY (J, L) 
1 
|J|* 


Now consider the largest 0 < £ < m that satisfies 


lAm| <¢ aoa beeen (63) 


Ael < y7™ [Actil- (64) 


Note that this use of £ is quite different than that used in (56). 
So suppose first that £ satisfies 1 < € < mand is the largest index satisfying (64). 
Then if £ < m we have |A;| > yom |Ai+1| for £ + 1 <i <_™m, and so both 


_m-e P* (VJ, pW) 
c——— 


at _ mt 
|Acuil > yy ™ Agpa|>...>¥° ™ Ample = ni , (65) 
Jia 


ad 
JAils...S Ad sy lAcsil. 


Both inequalities in the display above also hold for £ = m by (63) and (64). Roughly 
speaking, in this case where 1 < £ < m, the gradient of R®” uz has modulus at least 
|Ac+1| in the directions of e¢+;,...,€,, while the gradient of R°”w has modulus at 
most y~ 2" |A¢+1| in the directions of e1,..., e¢. 

Recall that S’-& = =i is the subspace on which the symmetric matrix 
M (cy) = V (R*" 2) (cz) has energy €"M (cy) € bounded below by |A¢+1|. Now 
we proceed to show that 


2 1 
Penh MFEU.0) ST | / IR" (x) — Rp (2) 2 doo (x) deo (2). (66) 
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We will use our hypothesis that w is k-energy dispersed to obtain 
EV,w) <ExV,@) < En VJU,o) < Ee J, o) 
since £ < m < k. To prove (66), we take L, = Bs as in (62) and begin with 


2 
dist (x, L,)* = dist (x. z+ (s*)") (67) 


2 


’ 


= (xe41 — 041) +--+ On — Zn) = |x” =" 


where x = (x’,x”) with xX € R‘ andx’ eR’, and L, = {(u’,z"): uw’ € R}. Now 
for x,z € J we take §& = (0. =x) e grt (where ° = 0). We use the estimate 


du (y) 1 du (y) 1 P* VJ, ) 

£ [VR ulsooey $ bl ) : | 

"| “lly Qn S| |" R"\ ys ly —e,|"° +? ~y R'\yJ ly —e,|" et} y la} 
(68) 

to obtain 
i 2pa,n 1\? 
eI, (IR lemey be a1) do) deo (69) 
JJJ 


2 
dw (x) dw (z) = om (J, E(VJ,w). 


SoU. we Lt 


We then start with a decomposition into big B and small S pieces, 


—3Z| 
ll" 


1 
VE / / [R°" pW (x) — R®" pW (2)|? dw (x) dw (z) 
2 ef [mere ex) Ren 2) do doe) 


sf [Ren (x’,x”) — R°"w (2.x) |? dw (x) dw (z) 
® IIIT 

=B-S. 
For w € J we have 


|VR°"W (w) — M (c))| = |VR°" yu (w) — VR" (c))| (70) 


1 P* (J, u) 


itl pou & T°? 


S jw PR 
yo slr 
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from (68), and this inequality will allow us to replace x or z by c, at appropriate 
places in the estimates below, introducing a harmless error. We now use the second 
inequality in (65) with the diagonal form of M (c;) = VR®"u (c;), along with the 
error estimates (69) and (70), to control S by 


S< sell x —Z)-V/R°"w (x)? do (x) dw (z) 


an 2 
+o ff {Ive Heo bk’ — Z| \ doo (x) do (2) 


S oe ddl x —2z/) VIR (cd? dw (x) dw (z) 


tre). | {| VR" ogy be — Z| MIF} deo (a) deo (2), 
JL art 
and then continuing with 


<p Tank x fz [Ael} do (8) des (2) + GP" VJ, w)° E(J,@) 


ax 


1 1 1 
— lesley ff bea deo) doe) + SPU. WE Voy 
y J Jy da y 
1.2 1 
a - Vl" eu EV, @) + ol VJ, np) EV,o), 


which is small enough to be absorbed later on in the proof. To bound term B from 
below we use (70) in 


Ro u(Z, / x") - Ro u(z,2) = (x” = z’) : VR " L (2) +O (| VPR™ || 00) |x _ dl’) 
= (x” -_ 2’) : VR L (cy) +O (| V?R™ || 00, |x —z| i") , 


A 


and then (59) with the choice € = (0 ir=m) € € S’~*, to obtain 


> Tx 
x” =2"[ egal s |e 2" |(E-V") ROW (cr) €| 


= |(x” -” z’) VR" 1 (cz) + é| 
< I(x” _ z’) VRE (cy)| 


< |R7" (Z,x”) —R™ uw (z a z") )|+0(| vere. 


i 
| oou) ZI ult). 
(/) 
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Then using (69) and (70) we continue with 


aed, — Ry (2',2”) |’ de (x) do (2) 
aera are <2" P des (9) do @) = SPU WP Eo) 
and then 
Aegi? is E (Vo)? <Clacil WI" Ec Vo) (71) 
= |Ac4+i| ae fae Es dw (x) dw (z) = esil? ake Pale dw (x) da (z) 


So ve oo x”) — Rp (2,2)? doo (x) dw (2) + ae J. wy EW) 


2A 


ars an _ Ran 2 a5 cl 2 2 
se fie Et (x) — R*" py (2)| do day SF (VJ, u) EV, a) 


2A 


1 1 2 
ae Jf iwetne) — Re WEP deo 0) do (0) + * aeiP MEG? 
o JdJdJ 


since ae (VJ, uw) EV,@) < t MG |Ac+i|? E(J,) for y large enough depend- 
ing only on n and q. Finally then, for y large enough depending only on n and a 
we can absorb the last term on the right hand side of (71) into the left hand side to 
obtain (66): 


Asal? VIE EG,0) $ — ral i IR" (x) — R°" (2)? do (x) deo (2). 


"i _m—t pe 
But since y~ "a cP UY 


< |A¢+1| by (65), we have obtained 
ly|n 


PY, w)7E U0)? < Gy Ren? Ml? EU.) 
1 
a i i Re" (x) — Rp (2)[? do (x) deo (2), 


which is the strong reverse energy inequality for J since 


an an (a) an yaw an 2 
ae sae | [IR Ren OP deo) do @ = BF [RO EPR |”, 


This completes the proof of strong reversal of energy under the assumption that 
1<f<m. 
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If instead £ = 0, then |A;| > yom |A:+1| for all 1 < i < m, and so the smallest 
eigenvalue satisfies 


a_i 2 _k _1 P* UV, ph) 
[Ail > y7 = [dol > pT ® [As] Se SE el SY ae 
Jin 
In this case the arguments above show that 
P* (7, u)\~ 1 
(ee EU 0 Se f [iH 0) RU OP do edo 
n w J J 


1 
+ Zz VJ,py EV,o), 


which again yields the strong reverse energy inequality for J since the second term 
on the right hand side can then be absorbed into the left hand side for y sufficiently 
large depending only on n anda. Oo 


Necessity of the Energy Conditions 


Now we demonstrate in a standard way the necessity of the energy conditions for the 
vector Riesz transform R®” when the measures o and w are appropriately energy 
dispersed. Indeed, we can then establish the inequality 


EOE 1 AS Eoin, 


So assume that (58) holds. We use Lemma 10.1 to obtain that the w-fractional Riesz 
transform R®” has strong reversal of w-energy on all quasicubes J. Then we use the 
next lemma to obtain the energy condition EX""® < Tane + ,/AF. 


Lemma 10.2 Let 0 < a < nand suppose that R“" has strong reversal of w-energy 
on all quasicubes J. Then we have the energy condition inequality, 


strong / ,&,punct 
ce Ss Lana + A, . 


Proof Fix y => 2 large enough depending only on n and a, and fix goodness 
parameters r and ¢ so that y < 2"!—®), Then Lemma 10.1 holds. From the strong 
reversal of w-energy with du = 1;,\,,;do, we have 


E(V,@)° P* (J, 1;,\yyd0)° 


< CE} |T (1,\,do) — By? T (1),\y,d0)|? 


2 


> 


ax 


2? |T* (1),\).da) |” S EY |T* (Ay,do)|? + E® |T* (1,0) 
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and so 


YW, EU. oy Pty? S > he (11,do) (x) |’ do w+ fir (1yydo>) (x)|" do (x) 
gous qe oe 


JEM (x,2)—deep (Ir) 


< / |T* (1),do) Cfrdow +> | |T* (1,2do) (x)|” deo (x) 
IL, 7 yt 


S Te Wrle + DS Toe lye S Tae Urlo 
J 


since yJ C I, for y < 2"'-®), and since the quasicubes yJ have bounded overlap 
(see [29, Lemma 2 in v3]). We also have 


- lo EU. 0)? PY (J,1ysdo)” < yO Aa iiss 
JEM (x 2)—deep (,) JEM (x, 2)—deep (,) 


by the bounded overlap of the quasicubes yJ in J, once more. We can now easily 


oO 


a,punct 
2 7 


complete the proof of E2""* < Tyna + 4/A 


Acknowledgements We thank the referee for a careful reading of the paper and numerous 
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A Partition Function Connected 
with the Gollnitz—Gordon Identities 


Nicolas Allen Smoot 


Abstract We use the celebrated circle method of Hardy and Ramanujan to develop 
convergent formule for counting a restricted class of partitions that arise from the 
Gollnitz—Gordon identities. 


Introduction 


The purpose of this article is to illustrate a beautiful application of the tools of 
complex analysis to a discrete subject: the theory of addition over the integers, also 
known as partition theory. 

A partition of a positive integer 1 is simply an expression of n as a sum of other 
positive integers. For example, taking the number 5, we find 7 different partitions: 
5,44 1,34+2,34+141,24+24+1,24+14+1+1and14+1+1+4+141. 

The number of partitions of n is denoted by p(n), and is often called the partition 
function. In our example, we have p(5) = 7. 

While partitions have been studied since the time of Euler [6], very little was 
known about the partition function itself before the twentieth century. Indeed, at the 
end of the nineteenth century, attempts to study the behavior of the prime counting 
function [14] had led to a general sense of pessimism in number theory [5]; it was 
expected that any careful analysis of p(n) would produce an asymptotic formula that 
was approximate at best, and certainly not useful for direct computation. 

It was not until 1918 that Hardy and Ramanujan developed the techniques to 
conduct a detailed study of p(n) [5]. The results of their work were astonishing: not 
only were they capable of achieving a formula that could give the exact value of 
p(n) with relative efficiency, but the formula itself is an utterly bizarre object, as an 
infinite series containing Bessel functions, coprime sums over roots of unity, and 
mz—analytic entities that seem wholly irrelevant to the question of simple addition 
over the natural numbers. 
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The techniques that Hardy and Ramanujan had developed are embodied in what 
is now known as the circle method. This method has since become one of the most 
basic tools in analytic number theory [11, 15]. 

Notably, the circle method has continued to contribute to the theory of partitions 
and q-series. Hardy and Ramanujan’s formula was carefully refined by Rademacher, 
first in 1936 [9] to make their formula for p(n) convergent, and again in 1943 
[10] as an adjustment of the method itself. Soon thereafter, it was realized that the 
techniques embodying the circle method could be used to develop formule for a 
variety of more restricted partition functions (two notable examples are [8] and [7]). 

We are interested here in one such partition function, associated with the 
Gollnitz—Gordon identities [3, 4], which we provide here for reference: 


Theorem 1.1 (Géllnitz—Gordon Identities) Fix a to be either 1 or 3. Given an 
integer n, the number of partitions of n in which parts are congruent to 4, ta 
(mod 8), is equal to the number of partitions of n in which parts are non-repeating 
and non-consecutive, with any two even parts differing by at least 4, and with all 
parts = a. 


Each identity—one for either value of a—equates the sizes of two different 
classes of partitions of n, while not actually indicating the class size itself. We will 
use Hardy and Ramanujan’s method, together with Rademacher’s refinements, to 
formulate a convergent expression for the number of partitions associated with these 
identities. 


Definition 1.2 Fix a at either 1 or 3. A Gdllnitz—Gordon partition of type a is 
composed of parts of the form 4,--a (mod 8). The generating function for such 
partitions is expressed as F,(qg), and the actual number of such partitions of n is 
given as g,(n). 


We seek a formula for g,(n). The author wishes to note his deep appreciation for 
the guidance and encouragement of Professor Andrew Sills, who first suggested this 
problem. 

In keeping with the theory of q-series [1, Chapter 2], we have 


Fug) =) galk)g (1) 

k=0 
= I] ( = | ids aa © 7 grad = go (2) 

m=0 
= : (3) 

PlGF PoE Ai Pe 
with 

(4: Joo = | [ (1 aq’). (4) 


j=0 
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Cauchy’s residue theorem [14, Chapter 3] gives us a means of calculating—at 
least in principle—the value of g,(n). Dividing F,(qg) by q"*!, we find that ¢,(n) is 
the coefficient of g~!, and is therefore the residue of F,(q)/q"*!. 


Theorem 1.3 


_ 1 f Faq) 
8a(n) aa ni grt dq, (5) 
Cc 


for C some curve inside the unit circle of the q-plane, encompassing q = 0. 


We must choose an appropriate contour for C. We then study F,(q) itself, 
including some of its useful transformation properties. Next, we will employ the 
circle method in reducing our integral (5) to something far more accessible to 
integration. We finish our integration using the theory of Bessel functions. 


Rademacher’s Contour 


Casual inspection of (3) suggests that F,(q) has important structure near the roots 
of unity of the unit circle. We will construct a contour that remains inside the unit 
circle, but approaches the roots of unity e?7/* in a controlled way. This contour 
was first used by Rademacher [10]. We use the contour in a form slightly modified 
by Sills [13]. 


Definition 2.1 For a given h/k € Fy, define the Ford circle C(h,k) as the curve 
given by 


a ee ) 
ele oes| ee 


Given the set of Ford circles corresponding to the Farey sequence of degree N, 
let y(h, k) be defined as the upper arc of C(h, k) from 


(h, k) : a eee 
t(h,k) = -— i 
: k ke +R)" P+ 
to 
h kg 1 
tr(h, k) = 


k’ e+e) | eR” 


with h,/k, and h,/k, the immediate predecessor and successor (respectively) of 
h/k € Fy (let 0,/1, = (N — 1)/N; similarly, let (NV — 1),/N, = 0/1). 
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) 1/3 1/2 2/3 1 R(r) 


Fig. 1 Ford circles C(h, k) for h/k € F3, with P(3) highlighted 


Definition 2.2. The Rademacher path of order N, P(N), is the union of all upper 
arcs y(h,k) from t =itot =i+1: 


PIN) = (J yh,b). (7) 


h/ke Fn 


We give an illustration of P(3) in Fig. 1. 

It may be easily demonstrated that consecutive Ford circles corresponding to Fy 
are tangent to one another, so that P(N) is a connected curve. Moreover, for t in the 
upper arc y(h, k), S(t) > 0; therefore, y(h, k) lies entirely in H for every h/k € Fy. 
Therefore, P(N) is a connected curve that lies entirely in H. 

So if we define gq = e?'", then we may define our curve C from (5) as the 
preimage of P(N). We will make one more helpful change of variables: 


mee (8) 
eS Et ie 
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with (z) > 0. This change maps C(h, k) (with y(h, k)) to the circle 


(9) 


Notice that the initial and terminal points of y(h,k) are mapped to z(h, k) and 
zr(h, k) by the following: 


k ke. 
(hk) + zi(h,k) = re ra ah (10) 
P P 
k ks 


tr(h, k) +> zr(h,k) = (11) 


e+e P+eR” 
We finish this section by referencing an important lemma, which can be proved 
quickly from the properties of the Farey fractions [6]. 


Lemma 2.3 Let N € N be given, withh/k € Fy. Let z(h,k), zr(h, k) be the images 
of t(h, k), tr(h, k), respectively, from C(h, k) to ome Then for any z on the chord 
connecting z;(h, k) to zr(h,k), we have 


lz] = O(N). (12) 


Transformation Equations 


We begin by expressing F,(qg) in terms of automorphic forms—in particular, as 
a quotient of eta functions by a theta function. Let g = e?”", with t a complex 
variable, S(t) > 0. 

Recall that Ramanujan’s theta function [2, Chapter 1] has the following product 
expansion: 


fg —@) = (Gt) 00g? TP 00 (q" 78; gt Peo. (13) 


Moreover, Ramanujan’s theta function is related to the standard theta function 
v, by the following, which can be verified by the series representations of both 
functions [11, Chapter 10]: 


f(-a", -4?) = —ieC2-P/49, (at |(w + B)r). (14) 
We then have 
Ga 
F,(q) = 15 
@ (41; q oof (-4*, -4°™) = 
CaaS 


=lexp(zit(2 —a)) (16) 


(q*: q+) oot (at |8t) | 


378 N. Allen Smoot 
Since (q*; q”)oo = €7""/!? (at), we can rewrite the remaining g-Pochhammer 
symbols in terms of eta functions in the following way: 


n(8t)* 


n(4t)91(at|8t) (17) 


Fa(q) = iexp(zit(1 — a)) 


We will now study the behavior of F,(q) near the arbitrary singularity e?7"/*, 
with 0 < h < k, and (h,k) = 1. To do this, we will divide our work into four 
cases, depending on the divisibility properties of k with respect to 8, and then take 


advantage of the modular symmetries of the 7 and 7; functions. 


GCD({k,8) = 8 


The simplest transformation formula relevant to our problem occurs for (k, 8) = 8. 
Let Hs be defined as the negative inverse of h modulo 16k: 


hHg = —1 (mod 16k). (18) 


Notice that since 8|k by hypothesis, and (h,k) = 1, therefore (h,16k) = 
(h, k) = 1, so that Hg exists. Then the following are elements of SL(2, Z): 


h —§(hHs + 1) 
k 
(59) r 
_4 
a Hs 
We will allow 
(age (21) 
T= i ee 


Applying (19) as a modular transformation to 8t’, we have 


Shr’ — §(hHs + 1) 
= 8r. 
8 8 ti — Hg 
Similarly, applying (20) to 4t’, we get 4r. 
Therefore, we will transform 7(87) to 7(8t’), using (19). Similarly, we transform 
n(4r) to n(4t’) using (20). 
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Invoking these transformations, we must contend with the roots of unity associ- 
ated with the 7 and 1; functions. As a shorthand, we will refer to the roots of unity 
as the following: 


<(8,8) =e @ —S (it 451), > -#) (22) 
4 k 
«(8,4) =e @ =; (hits + 1), 7, -#) (23) 


where €(a, b, c, d) is the root of unity given by 


d) s(1—c)/2 Zi (bd(1 — 2 5 
€(a, b,c, d) = A aap (ia Oat he ame fe (24) 
(S)exp (42 + Hlac(l—d?)+d(b—c))), 24d 
and (“) is the Legendre—Jacobi character. See [11, Chapter 9]. 
Invoking the functional equation for 7 [11, Chapter 9], it follows that 
8t)? 1 €(8,8)? n(8t’)? 
n(8t)" 1 €(8, 8)" n(8r') (25) 


n(4t) —z!/2 €(8,4) n(4t’) 


Handling %, turns out to be more difficult, due to the presence of a second 
complex variable. We will mimic our work with 7(87), using (19), and setting 


4 a(hiz~! — 1) 
v=atiz. = 4 


(26) 


The functional equation for 7; [11, Chapter 10] gives us 


9; (at|8r) = n(2 
1Z 


I co thectnag 
sr} = —i€(8, 8) ere TDF, (v]8r’). 27) 
z 


Recall that hHg = —1 (mod 16k). We can therefore write 
—1 = hHg + 16kM, (28) 
with M € Z. We then have 
v = aht' + 16aM. (29) 


If we also take advantage of the fact that 3) (v + 1|r) = —0i(v|r) [11, Chapter 10], 
then we have 


D) (v|8c’) = B) (aht’ + 16aM|8t') = ; (aht'|8t’). (30) 
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Again considering that (k,8) = 8 and (h,k) = 1, anda = 1,3, we also have 
ah = 1,3,5,7 (mod 8). We therefore write 


01 (aht'|8t') = Wy (bt + 8Nt'|8r’), (31) 


with b the least positive residue of ah modulo 8. 
We now make use of the fact that for N € N, 


d1(v + Nt|t) = (—1)% exp (—ziN(2v + Nt)) 34 (v|t) (32) 
[11, Chapter 10], so that 
31 (aht'|8t’) = (—1)" exp(—miN(2bt' + 8Nt’)) 9, (bt'|87’). (33) 
Combining (27), (30), (33), and inverting, we have the following: 


1 CD yo ena hie!—1)2 2k EXP (TIN (2bt' + BN’) 
=1 e~ : 
01 (at|8t) €(8, 8)? 7 01 (bt’|8t’) 


(34) 


We now have sufficient information, in (25) and (34), to reassemble the trans- 
formed generating function. 


ne 1 (8,8) (8? 
PN POPE 2) ee aay ey oan eD) ro 


x exp(ziN(2bt’ + 8Nt’)) a BEY (36) 
n(4t/)3 (bt'|8t’) | 
Here b = 1,3,5, 7. However, noting from (13) that 
f(-q",-¢°) =f(-¢?.-4"), (37) 


we may define F'5(q) = F3(q), F7(q) = F\(q). We therefore have 


“/__1\N 
F,(q) = aaa x exp (aici —a)— zna’(hiz! _ 1)? /8k 


+ wiN(2bt’ + 8Nt’') + wit’ (a—- 1)) Fa. (38) 


with y = exp(2zit’). 
Remembering that 


v= |S|-"=. (39) 
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and that 
a —4a+3 = (a—1)(a—3) = 0, (40) 


we may collect and reorganize the coefficients of 1, z, and 1/z in the exponential 
of (38). Doing so gives the following transformation formula: 


— 4/2 _ 
MN Stabse @ (2 9a 5))) FQ). (1) 


where 
i(—1) 14! 


_ i _ _ _ Rr 
ag(h,k) = (8. 8)¢(8.4) exp (= (h(5 — 4a) — Hg((b — 4) s)) ; (42) 


We note that we can extend this result to prove the modularity of F,(q) relative 
to a certain subgroup of SL(2, Z). We do not give the proof here. 


GCD({k, 8) <8 


The result of the section “GCD(k, 8) = 8” suggests that F,(qg) is modular, at least 
with respect to a subgroup of the modular group. While such a property does not 
carry over exactly to the remaining three cases, it is only necessary to show that 
F.i(q) = f(z) Vy), with Y(q) a suitable quotient of g-series. 

For each case (k, 8) = d, we will define 


fa THY _ (43) 
where 
= =~—1 (mod k/d), (44) 
and 
y= eonit’ (45) 


We consider the following matrices, which can easily be shown to be in SL(2, Z): 


8h/d —4(8hH,/d + 1) 
C=), “ 


Ge —4(8hHy/d + ”) (47) 
k/d —2H4 , 
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We also define 


=e (sure. —£(8htta/d ii, 4 1) . (48) 


e(d,4) =e (41a —£(8htta/d + 1), 4 2H) ‘ (49) 


with €(a, b, c,d) defined by (24). Remembering (44), we also let 


da(hiz—! — 1 M 
pe ge (50) 
8k 
with M € Z. Finally, we write 
7 iad (8hHa | i 
FE EN die \. od 
GCD(k,8) = 4 
With d = 4, we apply (46) to 4r’, and (47) to 87’, so that we have 
8r)? 1 €(4,8)? n(4r’)? 
(Br)? _ 1 €(4,8) n(4r’) a 


n(4t) — 2z!/2 €(4,4) n(8t’) 


As with the case of d = 8, # requires the most work by far. The initial 
transformation through (46) gives us 


1 


za? (hiz—!—1)*/8k ! 
Gain? Bi (vi4e’). (53) 


01 (at|8t) = —ie(4, 8)° 


And 


Dy (v/4c’) = (aie + > 


1’) : (54) 


We may now allow b = ah (mod 4), letting ah = 4N + b, so that (54), together 
with (32), gives 


D1 (v|4e’) = (—1)% exp(—wiN(2t'(2N + b) + aM/4)) 


M 
xd; (oe + =~ 


1’) . (55) 
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We now shift from #; to 34 [11, Chapter 10): 


0, (v|t) = iexp(—wit/4 — wiv)04(v|T). 


Write 
v, (o0' + ail a’) =0 (o- 2)t' eapies 7 M an! At ‘ (56) 
=iexp(—zi((b — 1)t’ + aM/8)) (57) 
x 04 (« =2)c" 4 le a’) (58) 


We now express 4 as an infinite product [11, Chapter 10]: 


%(o- 2)t’ pean lac ‘) = I (l- yi" = aa = eo) 


= (5S es One 1¥ Mock Pe OY Jos: (59) 


Combining (52), (53), (55), (58), (59), and simplifying, we have 


j= ode (= (- ce 5))) Weald), (60) 


af 2. 
with 
(q*: q*)3, 
Wa, = , (61) 
@) (@°: goof (—paag?; — 749°”) 
and 
= i(-l4! 
aah, = Ta De(a,4) 
x exp (Eu — Hy —h(4a — 3)(hHy + 1) + aQhHy + I(b— 2) . (62) 
GCD(k,8) = 2 


With d = 4, we apply (46) to 21’, and (47) to 41’, so that we have 


n(8t)? ss Ls € (2,8)? n(2t')? 
m(4t) — 2/2z!/2 €(2,4) n(4t’) 


(63) 
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Once again, %; requires the most work by far. The initial transformation 
through (46) gives us 


1 
1 (at|8r) = —ie(2, Tl '~I?/8k9, (y|2z’) . (64) 


And 


DB (v[20’) = (aie + =~ 


20’) ? (65) 


Notice that both a and h are odd. We may therefore write ah = 2N + 1, so that 


M 
BD, (v|2c’) = (—D% exp(—wiN(2t' + aM/4 + 20'N)) 0, (" ope 21') . (66) 
We now shift from 7, to 34. Write 
M —Ii M 
o, G Bs > 21') = iexp (Faw = am) 4 (+ 2’). (67) 
We express v4 as an infinite product: 
aM 
04 (+ 21') =yi¥ ol piaviy ed Oh oe (68) 
Combining (63), (64), (66), (67), (68), and simplifying, we have 
Fa(q) = earth, R)exp (2 (<(4a~ 5) Waa) (69) 
a = >= Wa »K) eX aL a — a , 
q We 2 p 8k z 2 
where 
2. g2y2 
Woe (70) 
(1°: D oof (—Pa,24: —Pa2M) 
and 
i(-1) Lt 


@a2(h,k) = (= (1 — (4a — 3) (2hHy + »)) 4496 


€(2, 8)€(2,4) 
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GCD(k, 8) = 1 
With d = 4, we apply (46) to 21’, and (47) to 47’, so that we have 


n(8t)> Le (1, 8)? n(t’)? 


= : 72 
n(4t) = 4z!/2 €(1, 4) n(21’) ee 
Returning to v1, 
= —ie(1 3 8arkzv? / ; 7 
0 (at|8r) ie(1, 8) WEI D1 (v|t’) (73) 
And 
M 
B, (v|t’) =v; (ane + > “) : (74) 


Recognizing that we may extract aht’ altogether from our first variable, and 
recognizing that (—1)“’ = (—1)", we have 


aM 


9, (v|z’) = (-1)" exp(—ziah(aht' + aM /4))0; ( ; 


“) . (75) 


aM 


We may now write (# 


M 
a (+ 


tT! ) in its classic product form [11, Chapter 10]: 


8 


“) = 2e7"'/4 sin(ztaM/8) 
oe) 

x I] d= ee -_ ie een _ ue a) (76) 
m=1 


= 2e7!*'/4 sin (aM /8)(y; Y)00(Pa,1 3 Yoo (Pa 1Y; Yoo: (77) 


Examining the sine function, let aM = 8N + c, with c the least positive residue 
of aM (mod 8). Then 


. mwaM ve. (He 
sin (7) = (—1)”" sin (=) : (78) 
Notice that sin ea) > 0. We know that since 


1 
M = —~(8hH + 1), (79) 
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and since (k, 8) = 1, therefore 
c = —ak"! (mod 8). (80) 


Moreover, k is odd, so kK~! = k (mod 8). So 


. mak 81 
sin (=)| , ( ) 


Combining (72), (73), (74), (75), (76), (77), (78), and simplifying, we have: 


ak 1 
F,(q) = we ath, k) csc (= JJex (= (;. a z(4a = 5))) Wii0), (82) 
where 
0; Y)oo 
Ww = : 83 
al? (975 ¥7) 00 (Pa,1¥3 Y00 (0713 Yoo ae 
and 
7) |) 1 
on, (h,) = 
, €(1, 8)e(1, 4) 
x exp (Fen —a+hH,(3 —- 4a)) -— i) . (84) 
Integration 


Recall from section “Introduction” that 


1 F,(q) 
Oni ght} 


8a (n) = dq, 


while in section “Rademacher’s Contour” we described a contour for C that will 
prove useful for integration. We will now begin the integration proper. 

Let N be some large positive integer, and let the corresponding Rademacher curve 
P(N) be given. Then we have the following: 


F,(q) 1 / F4(q) 
, dq = ue dq. 
8a(n) = sah qt 4 5 Oni gn 4 (85) 
: 
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In section “Transformation Equations”, we gave transformation equations for 
F,(q) depending on the divisibility properties of k. We now separate our integral 


into the corresponding cases: 
Ban) = gf (n) + Bh (n) + 8m) + 80 (n), 


with 


&M= > Ya f eta 


(k,8)=d, 0<h<k, 
k<N  (hk=1 ars k) 


In each case, we will tranform F,(q) by the following: 


gO(n) = > : > e2rinh/k 


(k,8)=d O<h<k, 
k<N (hK)=1 


zr (hk) h . 
: i 27nz/k 
x Fy 2, -~+— d. 
| (#°( “(7+ 7)))¢ ‘ 
zr(h,k) 
—7(@-3)/2 > _Taalk) = aah, Ke 2rinhlk 
(k,8)=d 0<h<k, 
k<N (hk)=1 
zr(h,k) rn q 
x | exp (= (AS ied) + z(16n + 4a — 5))) W,a)dz, 
z(h,k) 


with w,q(h, k) defined by (42), (62), (71), (84), Yaad) = 
as F;,(y) for d = 8, and (61), (70), (83), otherwise (note that w,,,(0) = 1); 


(b—4)?-8 ifd=8 


1 ifd=4 
A(a,d) , 
ifd=2 
1/4 ifd=1. 
a = log,(d), 
and 
esc(zak/8)| ifd=1 
Lia | 
1 otherwise. 


(86) 


(87) 


(88) 


(89) 


© Vaalj)y’ defined 


(90) 
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Thereafter, 


g(n) = 26-9? > ; ak) y Wa a(h, Ken 2tinh/k 


(k,8)=d O<h<k, 
k<N (hb=1 
x (1ia(h kb) + In), OV 
where 
ar(h.k) ee 
1(h, j= / exp (% ( (a4) + z(16n + 4a — 5))) dz, (92) 
, Zz 
z(h,k) 
and 
zr(h,k) oe as 
1 (h,k) = / a (OE pied 5) S- VaaGy'dz. (93) 
: 8k Zz = 
z(h,k) = 


In each of our cases, we will show that me k) will contribute nothing to our 
final formula. 


Lemma 4.1 Ford = 8,4,2, 1, 


S aah, be _ O (IPB +e y'/3) . (94) 


0<h<k, 
(ak)=1 


This result can be shown through Kloosterman sum estimation, using the 
techniques of Salié [12]. 


Lemma 4.2 


1y(h, 8)| = O (exp@naN-'). (95) 


Proof We may interchange the summation with the integration. Also, remembering 
= - (Ha . diz! 
that y = exp (27i (4 Fe )). 


1"(h, k) 


Cc 
= Po baa tal 


j=l 
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er(h,k) 
A(a.d _ 
x / exp (= ( (a, ” + s(46n + 4a ~ 5)) ) 2 kar (96) 
z 


8k 
zy(hk) 


[oe 
=o baa en el* 


i=l 


zr(h,k) 


x / exp (= (AS2-" + (16n + 4a—5))) dz. (97) 


zy(h,k) 


Notice that no matter the permitted value of d, the coefficient of 1/z in the 
exponent of the integrand is now always negative. 

Taking advantage of the fact that on and within om , RCA/z) = &k and R(z) < 
1/k, we now examine the magnitude of the integrand: 


exp (Z “a + 6n-+4a—5)))| (98) 
=exp (Sa, d) — 2dj)R(1/z) + a (lon + 4a —5)2(@)) (99) 
cosp( ») | m(16n ee an 
<exp(—mj/8 + 3nz). (101) 


We therefore have 


Mat. &)| 


ioe) 
< y- |Wad (j) | Jem iail™ | 


j=l 
zr(h,k) . 
x exp (Z (“S°-" + <(06n + 4a~5))) dz (102) 
z(h,k) 
aS zr(h,k) 
Do WaaDlexp(—nj/8-+ nn) fae (103) 
i= a(hk) 


Recall that we are integrating along the circle oe in the z-plane. We will now 
deform our contour so that it is a chord connecting z; and zy along K; 7 (Fig. 2). 
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Fig. 2 x with the chord connecting z;(h, k) to zr(h, k) 


Recognizing from Lemma 2.3 that the length of such a chord is bounded above 
by aconstant multiple of N~!, we have 


B= O[ > \Waa@|exp(—xj/8 + 3nx)N7! (104) 
j=l 
= O| exp(3nz)N7! )* |Waa(j)| exp(—7/) (105) 
j=l 
= O(exp(3nm)N~!). (106) 
oO 


Lemma 4.3 Let « > 0. Then 


i : 
yla-3)/2 y ; 1d (k) y aah, Ke rT k) 
(k,8)=d O<h<k, 
k<N (h,k)=1 


_ O(e* nl3n 3.) (107) 
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Proof We note that since 2°—%)/? and T,,q(k) are bounded, we may disregard both 
in our estimation. We now take the previous result into account: 


i —2z in. 
DE de Maalh eT (A, b) 


(k,8)=d O<h<k, 
k<N (=I 


1 
< bz ae a Waa(h, pe es . (108) 
(k,8)=d O<h<k, 
k<N (A,k)=1 


With Lemma 4.2, we know that 


Yate Solera”). (109) 


0<h<k, 
(AK=1 


This gives us 


i —2zin 
DE De Mall ue Th, k) 


(k,8)=d  O<h<k, 
k<N (hk=1 


1 3nx 1.2/3+€ 1/3 3nz 1/3 n7—-1 = feet 
=O > ae Rent) | = Of le" nN : . (110) 
(k,8)=d k=l 
k<N 
Recognizing that 
N N N N 
k2/3+¢ f2/3+2¢€ N2/3+2€ ; 1 
= = /3+2€ 
ps ko Ds kite = Ss kite x 2D kite’ (1) 
k=1 k= k=1 k=1 


that ys aK is bounded above as WN gets large, and finally noting that we may 
replace 2€ with €, we now have 


°( 


and the proof is completed. oO 


N f2/3+¢ 


k 


ee" pl/3nyT! 
k 


= O(e?"* nBNV3+ey, (112) 
c=1 
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We now have 


= i a 
Bin) = 20-9? SY Taalk) YY @aalh, he AT Gh, b) 
(k,8)=d O<h<k, 
k<N (h,k)=1 


aa Ofer anne) : (113) 


Our object now will be to put rhe (A, k) into a form approachable from the theory 
of Bessel functions. 

We now return to the original Rademacher contour of iy (h, k), along a portion of 
K . The brilliance of the contour becomes clear once it is realized that #(1/z) = k, 
i.e. is a constant, provided we remain along K, - (and avoid z = 0, of course). We 


wish to make use of the whole of Ke so we will make adjustments to the contour 
as follows: 


zr(h,k) 0 


wmnn=($.-f - | 


0 zr(h,k) 


exp (3 (AS + <(16n + 4a — 5))) dz. (114) 
& 


z1(h,k) 0 
Notice that f{ and / are improper: the integrand is not defined at z = 0. 
0 zr(h,k) 
We interpret these integrals as limits in which a variable approaches 0. We will now 
z(h,k) 0 
show that f and will not contribute anything of importance: 
0 zr (hk) 


Lemma 4.4 


0 
1 
iA exp (= (- + z(16n + 4a — 5))) dz| , 
8k \z 


7 (h,k) 


ar(hk) 


/ exp (= (“S se + z(16n + 4a — 5))) dz 
0 


8k 


= O(exp(3nm)N~'). (115) 
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Proof We will keep on kK for these estimations. Since the estimation is almost 


0 
identical in either case, we will work with the integral  . We begin by estimating 


zr (hk) 
the integrand of the integral: 


exp (= (AS + z(16n + 4a — ))| 


= exp (= (R(1/z) + H@)(6n + 4a — 5))) 


2 (5 («44 *)) 
< exp 
8k k 


mz  wm(l6n+ 4a—5) 
< exp 8 ae 


8k? 


< exp(3nz). 


We now estimate the path of integration: 


(116) 


(117) 


(118) 


(119) 


The chord connecting 0 with zr(h, k) can be no longer than the diameter of K™, 


so the length along the arc from 0 to zr(h, k) can be no longer than 


zr(h, k)|>. Since 


ler(h, k)| < /2/N, we have a path length that is O(N~'). This gives us 


0 
/ exp (< (= d + z(16n + 4a — 5))) dz 


T (h,k) 
0 
x [{ A(a,d) 
< / exp {| — | ——— + 2(16n+ 4a—5) 
8k Zz 


zr(h,k) 


< exp(3nz) / dz 
er (h,k) 
= O (exp(3nm)N'). 
zr(h,k) 


The case for fis virtually identical. 
0 


dz (120) 


As a consequence of the previous Lemmas 4.1, 4.2, 4.3, and 4.4, we have 
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Theorem 4.5 


: i _ 

g(n) = Q@ 3)/2 > ; ".a(k) > Wa ah, ke 2m inh/k 
(k,8)=d, O<h<k, 
k<N (A,k)=1 


A(a,d 
«$.,20(5 ( “ae Le c(16n + 4a~5))) 


a ON et) : (123) 


Estimating g® (n) 


In the case of d = 8 we can discard a large portion of what remains. Notice that 


er (hk) 
(1) us (b = 4)? = 8 
Tg(h, k) = exp 7 + z(16n + 4a —5) } } dz, (124) 
Zz 
zr(h,k) 
with b = ah (mod 8). If b = 1,7, then the coefficient of 1/z in the exponent is 1. 


However, if b = 3,5, then the coefficient is —7, and by almost identical reasoning 
of Lemmas 4.1, 4.2, and 4.3, applied to J ce (h, k), we have 


High, k)|=0 (exp(3nx)N7!) . (125) 


Now @ = 3 and T,,a(k) = 1. Since b = 1,7 implies h = ta (mod 8), we have 


g®) (n) = ; bs Wath, ken 2inh/k 


(k,8)=8 O<h<k 
k<N (A,k)=1 
h=+a (mod 8) 


1 
x $.. exp (= (- + 2(16n + 4a — 5))) dz+ O(c" nl/3nN-'3+") (126) 
k 


Estimating g (n) 


Lemmas 4.1, 4.2, 4.3, and 4.4 are sufficient to complete the estimation of gl? (n). 
With a = 2 and T, 4(k) = 1, we have 
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Theorem 4.6 


1 i : 
(4) a = h.k —2minh/k 
P= YL ED euath. be 


(k.8)=4, | O<h<k, 
k<N (hkK)=1 


1 
x exp (= (= + :(101 + 4a~5))) dz 
KO? 8k x. 


a. Oe" aN Es) ; (127) 


Estimating g? (n) 


In the case of d = 2, the coefficient of 1/z in the exponential of the integrand is never 
positive. Therefore, we may immediately apply the reasoning of Lemmas 4.1, 4.2, 
and 4.3 to both 1 (h, k) and 1" (h, k): 


(A, | = 3 (h, | = O (expGnx)N7!). (128) 
Therefore, 
Theorem 4.7 
g(n) = Oe nen Pe) . (129) 
ios (1) 
Estimating g, (n) 


Lemmas 4.1, 4.2, 4.3, and 4.4 are sufficient to complete the estimation of gi’ (n). 
Noting that a = 0 and 7, 4(k) = | csc(aak/8)|, we have 


Theorem 4.8 


1 i mak ; 
() = —2minh/k 
gy (nb = ) ese ( ) ) a1 (h, ke 
2/2 (k,8)=1 k 8 0<h<k, 
k<N 


(f,.k)=1 


Ot! arténtde—%) \ \a 
x >, O*P Sk at n+ 4a—5) be 


a O(c?" n' BN 1/36) . (130) 
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Complete Formula 


Combining (126), (127), (129), (130), and collecting the error terms, we have: 
a(n 
a(n) = 5 5  ; 


k 
ese (“ ) oat) 
«8) 8)= ik 8 


1 
“fo (F(z + (06n + 4a~5))) de 


+ ae Aral DQ, ex (= (- + (06n + 4a~5))) de 


(k,8)= a 
k<N 
1 
+ yy Dost Dg, en (Z(- + (16n + 4a~5))) dz 

K8)— 3K 

+ 0 (enn l3te) | (131) 
with 

Aga = >)  agalh, err, (132) 


O0<h<k, 
(h,k)=1, 
h=a (mod d) 


We represent the remaining integrals with modified Bessel functions [16]: 


Lemma 5.1 


1 
§.. exp (= (- + 2(16n + 4a — 5))) dz 
K;, z 


—2ni V16 4a — 
ai I (: n+ 4a ), (133) 


Jiti-aa=5 4k 


exp (= (= + (16n + 4a~5))) de 
Ky 


—Ti V/ 16 4a — 
_ a h oA n+4a—5 (134) 
J/16n + 4a—5 8k 
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The first equality may be proved by changing variables, first by z = 1/w, and 
then by w = 8kt/z. We may then represent the integral with the modified Bessel 
function J; [16]: The second equality may be similarly proved by changing variables 
by z = 1/w, and then by w = 32kt/z. 


Finishing the Limit Process 


We now take (131), with A, 4(u, k) defined by (132), substitute and simplify through 
Lemma 5.1, and let N — oo. We now have our final formula. 


Theorem 5.2 Let g,(n) be the number of type-a Géllnitz-Gordon partitions of n, 
with a = | or 3. Then 


(*)| Agia), (= + 4a— *} 
1 


qQht 
Ba(n) = w= = k 8k 


3 ss k), (=) 
+ ai 3 eae , Ak 


Qn Agg(n, k) m/16n + 4a—5 
de 2 ; if ak . (135) 
V16n+ 4a—5 (.=8 


Numerical Tests 


Mathematica was used to test (135), for k truncated. See the tables below (Tables | 
and 2). For the first 200 positive integers, our formula with k < 3./n gives the 
correct value with an absolute error less than 0.33. Since g,(n) € Z, we need only 
round our formulaic value to the nearest integer to achieve the correct answer. 
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Table 1 g,(n) compared to (135) truncated for k, with a = 1 


N. Allen Smoot 


n gi(n) Eq.(135),a=11<k< 3./n Absolute error of (135) 
1 1 0.7784305652 0.2215694348 
2 1 0.7196351376 0.2803648624 
3 1 1.114485490 0.114485490 
4 2 1.890769460 0.109230540 
5 2, 2.146945231 0.146945231 
6 2 2.174897898 0.174897898 
7 3 2.917027886 0.082972114 
8 4 3.994864237 0.005 135763 
9 5 4.903833678 0.096166322 
10 5 5.108441112 0.108441112 
20 26 26.07125673 0.07125673 
40 288 287.9388309 0.0611691 

60 1989 1988.942843 0.057157 

80 10,570 10,569.99993 0.00007 

100 47,091 47,090.99132 0.00868 

150 1,191,854 1,191,853.996 0.004 

200 18,900,623 18,900,622.99 0.001 
Table 2 g3(n) compared to (135) truncated for k, with a = 3 

n g3(n) Eq.(135),a=3,1<k< 3./n Absolute error of (135) 
1 0 0.290887 1603 0.290887 1603 
2 0 0.1385488254 0.1385488254 
3 1 0.8129880460 0.1870119540 
4 1 0.9584818018 0.0415181982 
5 1 0.8666320258 0.1333679742 
6 1 0.9177374697 0.0822625303 
7 1 1.323340028 0.323340028 
8 2 2.095679009 0.095679009 
9 2 2.042654099 0.042654099 
10 2 1.953812941 0.046187059 
20 12 12.01649403 0.01649403 
40 127 126.9760443 0.0239557 

60 865 865.0090307 0.0090307 

80 4560 4560.002784 0.002784 

100 20,223 20,223.00416 0.00416 

150 508,454 508,454.0481 0.0481 

200 8,034,534 8,034,534.006 0.006 


A Partition Function Connected with the Gollnitz—Gordon Identities 399 


References 


15. 


16. 


. G.E. Andrews, in The Theory of Partitions. Encyclopedia of Mathematics and Its Applications, 


vol. 2 (Addison-Wesley, Reading, 1976). Reissued, Cambridge, 1998 


. B.C. Berndt, Number Theory in the Spirit of Ramanujan (American Mathematical Society, 


Providence, 2006) 


. H. Géllnitz, Partitionen mit Differenzenbedingungen. Journal fiir die reine und angewandte 


Mathematik 225, 154-190 (1967) 


. B. Gordon, Some continued fractions of the Rogers-Ramanujan type. Duke Math. J. 32, 


741-748 (1965) 


. G.H. Hardy, S. Ramanujan, Asymptotic formule in combinatory analysis. Proc. Lond. Math. 


Soc. 2(17), 75-115 (1918) 


.G.H. Hardy, E.M. Wright, An Introduction to the Theory of Numbers, 6th edn. (Oxford 


University Press, Oxford/New York, 2008) 


. J. Lehner, A partition function connected with the modulus five. Duke Math. J. 8, 631-655 


(1941) 


. L Niven, On a certain partition function. Am. J. Math. 62, 353-364 (1940) 
. H. Rademacher, On the partition function p(n). Proc. Lond. Math. Soc. 43(2), 241-254 (1937) 
. H. Rademacher, On the expansion of the partition function in a series. Ann. Math. 44(2), 


416-422 (1943) 


. H. Rademacher, in Topics in Analytic Number Theory. Die Grundelhren der Mathematischen 


Wissenschaften, Bd. 169 (Springer, Berlin/New York, 1973) 


.H. Salié, Zur Abschatzung der Fourierkoeffizienten ganzer Modulformen. Mathematische 


Zeitschrift 36, 263-278 (1933) 


. A.V. Sills, Towards an automation of the circle method, in Gems in Experimental Mathematics. 


Contemporary Mathematics, vol. 517 (American Mathematical Society, Providence, 2010), 
pp. 321-338 


. E.M. Stein, R. Shakarchi, in Complex Analysis. Princeton Lectures in Analysis, vol. 2 


(Princeton University Press, Princeton, 2003) 

R.C. Vaughan, The Hardy-Littlewood Method, \st edn. (Cambridge University Press, 
Cambridge, 1981) 

G.N. Watson, A Treatise on the Theory of Bessel Functions, 2nd edn. (Cambridge University 
Press, Cambridge, 1944) 


On Toeplitz Operators with Quasi-radial 
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Abstract We explore a new wide class of symbols that generate commutative 
Banach algebras on each weighted Bergman space on the unit ball in C”. These 
symbols are a natural extension of the previously studied quasi-radial quasi- 
homogeneous symbols, and contain them as a very special particular case. Roughly 
speaking, instead of the fixed specific bounded continuous functions we admit now 
any Loo-functions. 
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Introduction 


After a successful description of commutative C*-algebras generated by Toeplitz 
operators on the unit ball B” [6, 7], it was unexpectedly observed that for n > 2 there 
exist many others algebras generated by Toeplitz operators that are commutative 
on each weighted Bergman space. All of them were Banach (not C*), and their 
description was done in terms of generating Toeplitz operators. The first result in 
this direction [9] deals with the so-called quasi-radial quasi-homogeneous symbols. 
The subsequent results [1, 2, 8] were based on similar ideas exploiting a quasi- 
homogeneity of corresponding symbols. Note that the notion of quasi-homogeneous 
symbols is based on the use of the spherical coordinates. 
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Quite recently [5], for the case of two-dimensional ball B*, the much more 
wide class of quasi-homogeneous like symbols was described. These symbols also 
generate via corresponding Toeplitz operators commutative Banach algebras on 
each weighted Bergman space, and their existence was hidden just by the use of 
the spherical coordinates in the approach of [9]. 

This paper extends the results of [5] for the case of the unit ball B” with 
n > 2, and explores a new wide class of symbols that generate commutative Banach 
algebras on each weighted Bergman space on these balls. We call these new symbols 
pseudo-homogeneous, they include the previous quasi-homogeneous symbols as a 
very special particular case. Roughly speaking, instead of a fixed specific bounded 
continuous functions we admit now any Loo-functions. 

Section “Preliminaries” collects notation used throughout the paper. In 
section “Quasi-radial and Pseudo-homogeneous Symbols” we introduce pseudo- 
homogeneous symbols and describe the action of Toeplitz operators with 
quasi-radial and pseudo-homogeneous symbols. Section “Commutative Algebras” 
devoted to the description of commutative Banach algebras generated by Toeplitz 
operators with above symbols and of some their common properties. Yet another 
option to build commutative Banach Toeplitz operator algebras is presented in the 
last section “Yet Another Option”. 


Preliminaries 


Let B” be the unit ball in C”, 


B" = {z= (41,...,.2n) €C": [zl? = Jal? +... 4 lel? < 1, 


and let S?”—! be the corresponding (real) unit sphere, the boundary of the unit ball 
B". 

In what follows we will use the notation t(B”) for the base of the unit ball B”, 
considered as a Reinhard domain, i.e., 


r(B”) = f(r, -..57%m) = Cal... len) ¢ PHA +... +72, € [0, Dh. 


We denote as well by B” the real m-dimensional unit ball, 


B” = {x = (x,...,%,) € R”: Paxait...tx¢ <1. 


Then, of course, t(B”) = B” 1 RY =: BY. 
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Given a multi-index @ = (a ,Q2,...,Q,) € Z we will use the standard 
notation, 
a! = ay!ao! +--+ a,!, 
ag a ee, 


for p = (p1,p2,---,Pn) € Z" we set 


IpPl=pit+po+..-+Pn and — |[pll = [pil + [pal +... + [pal- 


Denote by dV = dx,dy, ...dx,dy,, where z) = x; + by, 1 = 1,2,...,n, the 
standard Lebesgue measure in C”; and let dS be the corresponding surface measure 
on S?"—!, We introduce the standard one-parameter family of weighted measures, 


_ P@+at) yaya _ 
dv;(z) = wTat)D (—|z*)*dV(z), A>-1, 


which are probability ones in B”; and recall (see, for example, [10, Section 1.3]) that 


alP(n+A+1) 
“8 Pin+ lal +A+ 1) 


/ 2 dv,(z) = 8 (1) 


We introduce the weighted space L2(B",dv,) and its subspace, the weighted 
Bergman space A? = Aj ( B”), which consists of all functions analytic in B”. The 
(orthogonal) Bergman projection B, of L2(B", dv,) onto Att ") is given by 


g(f) dv.) 


i (1 =7Z: Cyrt ati . 


(Big)(2) = i: 


Finally, given a function a(z) € Loo(B"), the Toeplitz operator T, with symbol a 
acts on A; (B") as follows 


Ta:  € A} (B") +> By (ag) € AZ (B"). 


Quasi-radial and Pseudo-homogeneous Symbols 


Let k = (kj,...,k,) be a tuple of positive integers whose sum is equal to n: ky + 
... + kn = n. The length of such a tuple may obviously vary from 1, for k = (n), 
ton, fork = (1,..., 1). 
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Throughout the paper we fix a tuple k = (k,...,km), with k} < ky <...< km, 
and rearrange the n coordinates of z € B” in m groups, each one of which has k;,, 


j=1,...,m, entries. We will use the notation 
Za) = Bars + +s Ze )s 22) = Base + +s Zako)s eos Sm) = nts +++ Zn) 
with 
Z11 = 21, 21,2 = 22, +++ Zhky = Zk 
22,1 = Zkytds ees 22jky = Zk thy +> (2) 


Note that the above ordering condition on the tuple k can be easily fulfilled for an 
arbitrary (non ordered tuple) by making the biholomorphism of the unit ball that 
interchanges the coordinates of z. 

We will also use an alternative representation of a point z = (Z1,..., Zn) € 


Z= (2, .- +5 Zim), where z,) € BY, j=1,...,m. 


In general, given any n-tuple u, we will also use two its alternative representations 


u= (uy, eee Un) = (uq), re Uy): 
where 
Uji = Uj, U2 = U2, ..-, Uk, = Uk, 
U2,1 = Uti, ++) U2ky = Ubythys -+ +> Unjkn = Un- (3) 


We represent then each coordinate of z € B” (which is the same as each 


coordinate of zi, j = 1,...,m) in the form 
wz=lalt or ye= lyelfe, 
where #; and f;¢ belong to T = S'. For each portion zi, j = 1,...,m, of a point z 


we introduce its “common” radius 


r= zj,1|? Se ee IZ.4l?5 


and represent the coordinates of z;) in the form 


_ = = KV kil A phi 
Ze =r Setje, where £=1,...,kj, sq = (Sj1,---, S84) ESP t= ST ORG. 


Recall [9] that a bounded measurable function a = a(z), z € B", is called k-quasi- 
radial if it depends only on, ..., Mn. 
The following result is Lemma 3.1 of [9]. 
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Lemma 3.1 Given a bounded measurable k-quasi-radial function a = 
a(r},.--,1m), we have 


Taz = Yaka (a) a, ae Z, 
with 


Ya.k,A (a) = Ya,k.r(le(1)|, sees |or(m)|) 


of T = al/+ta+1 
= ( | bs ) / a(ri,.-.5 Tm) A — [r|2)* 


ayy |-+2kj—1 
x a ol 2k; dr; 
j 
j=l 


_ Tint jal +441 
Tr” + 1) T]jz 1 V(k; + |r) |) Am 


a(J/ri,.--, Tm) 
x Ty lagi “t = (47% ..c mde hy, 


where Am = {((11,---5%m) € RE: mt... + 1m € [0, 1)}. 


We introduce now an extension of the class of quasi-homogeneous functions 
defined in [9]. Note that here we use a slightly different notation for multi-indices, 
instead of a pair of the orthogonal multi-indices p and q of [9], we will consider just 
one multi-index p = (p,...,P,) that satisfies the condition |p| = 0. 

A function w is called pseudo-homogeneous (or k-pseudo-homogeneous) if it has 
the form 


m 


UW) = (sq, --- Sim)? = B(say,-- seo) | L4G) > 


where b(s(1),.-.,5(m)) € it xX... X s’) and p = (pj,..-,Pn) 
(Pa); a Pim) EZ". 

Most frequently we will consider the case when b(s(1),..., S(m)) = T]jz1 bj (sq) 
with bj € Loo(S'), for all j = 1,...,m 

Note that quasi-homogeneous functions, introduced in [9], correspond to the case 
when each b; = b,(sqj) has the form 


sik i IPi.kj| 
b; ri (Si) = = pie Si kj ; 
so that 


m 


Piajl_P. 
v2) = Is Sr Sia a (4) 
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Consider now the following k-quasi-radial pseudo-homogeneous symbol 


m m 


(2) =alri,....%m) | [b(sp) ? = a(ri,.... tm) [ [ ailsm) 


j=l j=l 


where a = a(ri,...,m) € Loo(t(B")), bj = bj(sqy) € Loo(St'), j = 1,-..,m 


and t = (t1,..-,tn) = (t(1),---5t@n)) E T”, p = (P1,---5Pn) = (Pa), -+++P(m)) E 
Zz". 
In what follows we will use the parametrization of each se a by its first 


kj — 1 coordinates s;,,...,5;4,-1, 80 that 5; 4, = ji- (s5 +. + Siu -1)s and 
kj-1 
bj = bj(sq) = bj(sj,. ane + Si 1) © Foo (By. 


Euclidean volume element dS; on S a , used in the next lemma, is given by 


). In this parametrization the standard 


dsj, ne S}j.45-1 
a — (57 + vos S41) 


Lemma 3.2. The Toeplitz operator T, with the above symbol (z) acts on monomi- 
als 2°, a € Z"_, as follows 


dS; = 


’ 


act 0, if di such that a; + p; < 0 
i — 
Vorprazt?, if Vi aj +p;>0 


with 


Tant+la+pl|+A+1 
PA +1) TF, Pda@l + slpal + &) 


Vokpa (a) = 


xf UI Tio Jig) elt POET) (1 — (ry tot tt dri dn 
Am 


m 


e Il Pla + Slpwl + &) bist?) Ths jet Spy 
kj 
j=l Tea Pe + Pie + 1) 444-1 


: 1,. 
x el = (sj 2 reer Sjukp—1)) 38 PIs dsj ec dS} K-15 


where b(si)”) = = b(Jsj1,---, JSS) 


Proof Given two multi-indices a, B € Z"., we calculate 


(Tyz", 2 [ane 


Tin+A+4+1) en, fae ie, ? 
= [ [4 T | tbc] — P+? dy 
T+ 1) Joe A scale ” 
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m 


_Ta+at Ta+at+) / 1 2a 
~ by] | lel tPit8 a = |r?) dlzli---dlzin 
aT (A + 1) + 1) (B") all T 


7 Tf ei Bitpi dt, 
it, * 


The last n integrals are different from zero if and only if 8 = a + p, in the last case 
each of them is equal to 277. Thus, setting B = a + p, we have 


m 
Sas ane alee 


jV(n tat) | a gees 2A 
So bj | | let = |r]?)* dlzli---dlzln. 
~TA+) Srey I] ay 


Changing the variables rj.¢ = rjsj,¢ (with d|z|j1-+-d|zlj.4, = ri" dr)dS;), we obtain 
Taa+AaAtl) 


aa See et ari, a Tm) 
TO + 1) 7(B")x] [7 Ae 


Gees 


2 horgy | + 1pqy|+2k)— re 2a ¢+pje+l 
tO Eee lie One Loco [Ts FEE ads; 


j=l j=l 
Tan+Aa+t+1) 


eae “7 elt sb@lth 
=) ———___ a(J/T]1,--+5ST in r, 
TA +1) An (Jr ) I J 


m kj 
2eje-+pje+l 
x I-(it+...+ Fay)" dr, +++ dry I] ie bj(sq) I] Cd PHT" dS 
jar 254 (=1 


Ta@t+ Ati) “T lepl+dlowl+é—1 
acres ae [a Viiv vim) Pg 
j=l 
aE te ds lh 
x (= (ri +... + tm) dry +++ drm T2"" : bio Te as; 
j=l Sy (=1 
T@t+ Ati) “I laltsipl-+kj—l 
See: [a Vries im) TT] 1; 
j=l 


x (L— (ry 4... + tm) dry ++ dtm 
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m kj- 
x ans sq) ne .o 
f=1 


2 2 Oj. FSD 
x Liye Six) MET ING AS; + OS) i —1 


_Tam+ati) 


“ lawl+3lp@l+h-1 
a(/T1,..+54/Tn r, 
TA+1) Jy, ( I I 


x d-(7 + ne as 


m 


x If nT] 


wd dos: 
x (l= (sj +... + ee dsj.1 +++ ASjey—-1, 


eae 


where bee = bU/SiAy +++) JSD: 
For 6B = a + p, by (1), we have 


(a+ p)!T(n+A+4 1) 


atp ~a+tp, _ 
gO yz = 
( Tan+lo+pl+A+) 
z Tint+A+4+1) 
= T(@a+p;+1 : 
I] Trestle eo( sit 
That is 
Te 0, if di such that a; + p; < 0 
—— ; 
* Voo.k,p, (a) ere, ifWia; +p; >0 
where 
= Tat+leo+pl|+At+l 
Tpkpal@) = : [oii vi 
PA + 1) [Jai del + zpw@l t+ &) Jan 


vrs wale lipathoe 
xT] alt zp@l+k "(Lh = (yc RAY Gh de 


m 


, Il P(lag| + slpwl + &) bist/2) rst ajet Spi 
k; 
j=l ae Pap + Pie + 1) a 


foe Ml a 
x ad = (Sj. eee HF Sj,kjp—1) Ha Pi dsj . ** AS; 4-1. 


For the case when |pqy| = 0, for each j = 1,...,m, Lemma 3.2 yields 
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Corollary 3.3 The Toeplitz operator T, with symbol 


m 


gz) =alri,..-.tm) | [b(s@) 4 


j=l 
where |pij)| = 0 for each j = 1,...,m, acts on monomials z*, a € Z"_, as follows 


7 if 4 £ such that ay + pe < 0 
a — 
* Foepalayzt?, if VL oy + py > 0 


’ 


with 


Tint lal +A+1) 
TA + DIG Taal + &) 


Vokp.ad (a) = 


x / a(J/ri,.- +, Tm) ao (a(n ete WV ince 
m j=l 


” P(lag| +k) ; 1/2 oe t+ Aju 
xTT= bjs?) T] stat 
fl Rca: 


jai Lee Giz + pie + 1) 


gol aes 
0 = (gg et gg) dy ais 


where b(s;', y= = DSS, ++ > Si) 


Observe now that for g(z) = a(r,.. -pTm) we have that Yakp.a(@) = Yakpa(@) 
(see Lemma 3.1); and for g(z) = bj(sqj) Po , |P@| = and pw = 0 for all / F j, we 
have that 

Touts 02 = Myton (Oe? 
where 
Pla] +) Ty aaiets 
ey ;) j 1/2 ALT ZPIL 
Fisog (a) = EY _ fas T] se 
Tex1 TD(qje + pie + 1) 44% l=1 
og Sp dy. 
x ad = (sj1 SF ioe Seer a dsj sie dS; K;-1- 

The last formula implies, in particular, that the action of the operator ey Pa@ does 


wt ) 
not depend on the weight parameter 1. 


We mention as well that if k; = 1, then |pi| = 0 implies that py) = (0) and that 
bj(sqj) = const. That is, the corresponding part in a pseudo-homogeneous symbol 
can be just omitted. 
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Corollary 3.4 The Toeplitz operator Ty with symbol 


m 


gz) =alri,..-.tm) | [b(s@) 4 


j=l 


where |pij)| = 0 for each j = 1,...,m, acts on monomials z*, a € Z"_, as follows 


if 4 £ such that ay + pe < 0 


Tez = 4) ; ; 
Vokpa(a)zt?, if WV £ a + pe > 0 
with 
m 
Vo.kpa (a) = Vakpa (a) I] Vo,.kvg (a). 
j=l 
Thus the Toeplitz operators T,, T. og, J = 1,...,m, pairwise commute and 


ils) i 


ny) = Ty ng rhs 
ie bjt bj iy 


Note that for the quasi-homogeneous symbol (4), considered in [9], with |p| = 
0, for each j = 1,...,m, we have 


Vwkpa (a) = I] 


_ T(laq| + kj) [1 ae (pjet+lpje\) 
k; 
jet [Tear Pe + Pye + VD) 2 q-1 pay 


x (L= (sj te + 5g) OTD dg, 1 dsj 4-1 


a kj-l 
7 Il P(jaq| + kj) 7 V(@je + 5 ie + Ipjel) + YD 
T 


jar P(o@l + allpall + i) ey Pa@je+ pet YD 


which, after returning to the notation of [9], recovers formula (4.1) of [9] witha = 1. 


Commutative Algebras 


The results of the previous section, and especially Corollary 3.4, permit us to 
describe many new commutative Banach algebras generated by Toeplitz operators. 
To characterize them we proceed as follows. 
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We start again with a tuple kK = (k,,ko,...,km) of positive integers with kj + 
ko +... +kn = nand ky < ky < ... < ky». Consider then the set Loo (t(B”)) of all 
k-quasi-radial symbols a = a(r,...,1m), and denote by 7, (k-qr) the C*-algebra 
generated by all Toeplitz operators T, with k-quasi-radial symbols a € Loo (t(B”)). 

We fix then a tuple b = (bj, b2,..,bm) of functions b; = bj(sq)) € Lis) 
(or bj = Dj(sj,,---»5j;4-1) € Lae’); after the parametrization of — and a 
tuple p = (pi, p2,---,Pn) € Z" with the property |p| = 0, for all j = 1,...,m, 
and denote by 7 (k, b, p) the unital Banach algebra generated by Toeplitz operators 
T 20> for j = 1, -2e, MN, 


Note that, contrary to the case of the algebra 7) (k-qr), the action of generators 
T,_ po and thus the properties of the algebra T (k, b, p) do not depend on the weight 
i") 


parameter 1. 

Then, by Corollary 3.4, the Banach algebra generated by elements of 7; (k-qr) 
and elements of 7 (k, b, p) is commutative. We group all the ingredients that define 
this commutative algebra into one set d = {k, Loo(t(B”)), b, p} and denote this 
algebra by 7; (d). That is the Banach algebra 7) (d) is generated by the elements of 
the algebras 7, (k-gr) and T (k, b, p). 

In the rest of the section we will list some common properties of the algebras 
7T,(d); for the specific case of [9], see [3]. 

For each K = (k,...,&m) € Zl! we introduce the finite dimensional subspace 
H, of the Bergman space A; (B"): 


H, := span {ey lag | = «. j= dyer) 


We have that 


AB") = @ H,. 


|k|=0 


and that the orthogonal projections P, of Aj ( B”) onto H, belong [3, Corollary 3.3] 
to the algebra 7) (k-qr). 

Observe that each space H, is invariant for all operators from 7)(d), but the 
action of the generators of 7,(d) on the space H, is quite different. The Toeplitz 
operator 7, with k-quasi-radial symbol a acts on H, as the multiplication operator 
Vak.a(K1,---;Km)1, while each Toeplitz operator t, po ,j =1,...,m, acts on H, as 


a certain weighted shift operator. Moreover each operator T, pq, restricted to H,, is 
Jj) 


T. »% 
( biK 


where [x] denotes the integer part of x > 0. 


nilpotent 


= 0, (5) 


2K; 
) [ Togyl |= 


Hk 
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We remark next that the algebra 7, (d) is not semi-simple, it has a sufficiently 
large radical. The next lemma describes some of its elements. 


Lemma 4.1 The radical Rad 7, (d) of the algebra TJ, (d) contains the following 

operators 

— DyT, po, where the eigenvalue sequence y = {y(kK) cen, of the diagonal 
mt)) 


operator Dy € T,(k-qr) depends only on the component k; of K, i.e. y(K) = 
y (kj) and {y(kj)}cez4 © Cos 


hy 


-— D, T]jz1 ee 79 ie where hi € Z4 and the eigenvalue sequence y = 
{yea of the diagonal operator D,, € T,(k-qr) is such that y(«) — 0 when 
|x| — 00 under the condition that kj, —> 00 for at list one jo € {1,2,...,m} 
with hj, > 1; 


hy 
— Dy, T]jzi (z,,-9) ,h; € Z, with at list one jy € {1,2,...,m} with hj, > 0 and 
yY € Co. 
Proof For a special case of quasi-homogeneous symbols of [9] the result has been 
proved in [4, Lemma 6.7], and that proof almost literally extends to the above 
general case of pseudo-homogeneous symbols. Oo 


We note that the linear span of the operators considered in the last two items of 
Lemma 4.1, i.e., the set of all operators of the form 


a4 m hye 
atl (z,, re) (©) 
f=1 


j) 
j=l G 


where D,, € 7 (k-gr) and hye € Z4, for£ = 1,...,g andj = 1,...,m, forms 
a dense subalgebra D,(d) in the algebra 7,(d). At the same time, as in [4], the 
representation of operators from D,(d) in the form (6) is not unique. The next 
lemma explains the source of such an ambiguity. 


Lemma 4.2 Let 


fi 
f=1 j=l na 
where all tuples he = (hy ¢,..., Mme) are different, then the following statements 
are equivalent. 
Gi) A=0. 


hye 
(ii) For each € = 1,...,g we have that Dy, T]jzi @ ) = 0. 
JEG) 


(iii) For each€ = 1,...,g we have that y¢(|aay|,---.|& in|) = 0 if lag| = parol 
forallj =1,...,m. 
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Proof Follows the same arguments as the proof of Lemma 2.8 in [4]. 

The part (ii) = (i) is trivial. 

The part (iii) = (ii) follows from (5) and the conditions posed in (ili). 

To prove (i) => (iii), for a fixed a € Z"_, we introduce the set of polynomials 


m hye 
Fy = fa =11 (7,49) Ca : £=1,..,g, fia #9 
j= 


This set is either empty or consists of orthogonal elements of the space Ar B"). 


Observe now that among multi-indices a € Z"., with |agy| = ual for all j = 
1,...,m, there exists such @ that the set Fy is not empty. Then the orthogonality of 
elements of F, for that aw together with Ae, = 0 imply 


Dy fea = y(laq)|,---. lan few = 9, forall £=1,...,g, 
Vb ) (m) 
or y(|@qy|,---. |@@n)|) = 0 for all € = 1,..., 8. Oo 
Yet Another Option 
We start again with a fixed tuple on natural numbers k = (kj,...,km) with 
ky +...+k, = n, and again rearrange the n coordinates of z € B” in m groups, 
each one of which has k;, j = 1,...,m, entries. We keep using the notation 

Zay = (Zayas. ++ Ze)» 22) = (ats Zaks ves Zon) = Binds ++ Zink )s 


so that we have two different representations of z € B”: 


Z=(Z1,---52n) and Z= (a1),...,Zm))- 


In general, having any n-tuple x = (x1, ...,%,) and the above fixed tuple k, we write 


x= (xq), apa ,X(m))- 
Introduce now the indicator sets 


KW = (6 € {1,2,....n} 5 ze E zh, jJ=1,...,m, 


which specify the places of elements of zj) in the n-tuple z = (z,..., Zn). 
As in the previous section we consider a tuple p = (pj,...,Pn) € Z” such that 
each its portion pq, j = 1,...,m satisfies the condition |pj)| = 0. We introduce as 


well n-tuples pi), j = 1,...,m, by 


Pe, if Ley 


PHL = 
0, otherwise 
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We represent now each coordinate of z € B” in the form 


Ze = rete, where re =|ze| and tf € T; 
then we represent each rg as re = rse, where r = et +eeet r and s = 
(51,---,8n) € Se 
Recall that n-tuples t = (f|,...,t,) € T” and s = (s1,...,5n) € st admit the 


alternative representations 


t= (t(1), ie tun) and s= (sq), AS wy S(m)): 


Remark 5.1 The principal difference of this section compared with the previous one 
is that, contrary to the previous case of s = (s(1),.-..S(m)) € se Kona X Raia 
we have now “just one big sphere” s = (s,,...,5,) € Sook 


Introduce the symbol ¢ = bj(sij)) fin > where b; = bj(sqj)) is bounded and 
measurable, and the portion pq of a tuple p = (pj,...,Pn) € Z%. satisfies the 
condition |py)| = 0. Given two multi-indices w, B € Z", we calculate 


(Tyj2%, 2?) = (Wy zt) 


eee ” & pies, 
= b; | | wet Bee — }7-|2)4 72-P+P0 Gy 
mT +1) Jan . ene ° 


Tant+at Ta+atl) etBe+l WI fi ae Betpi 
= i 1 —|r|*)* dry +++ dry Oat 
w"T(A +1) 1) baa ay ( | |? ) 1° I ite * 


The last n integrals are different from zero if and only if 8 = a + pi), in the last 
case each of them is equal to 27. Thus, setting B = a + pi), we have 


(Ty,2%, 270) = (Vv; 2%, gat Pw) 


Tat+at+) + Xr + 1) eer tPoet ly 57 
as d seed n 
“TOL I. oT = area 


Tetaey fp We oct de. 
=?" (n a AS ) a gee eer ky, = r|?)* ar | bj IIs eTPOLTT I¢ 
PA+1) Jo Sia eee 


eee aut Pinetl 
= 72" | oes Se Rohe plelta— ‘a — ry arf TT LPG) 
FO+a) e 


51 Tiaa+A4+ D(a] +n) JT] ecto as 
Tantlol|+rti) Jeo TAA 


Toeplitz Operators with Quasi-radial and Pseudo-homogeneous Symbols 415 


We parametrize then Ss by n — 1 elements of s = (s1,...,5,) selecting them so 
that the remaining one does not belong to s,j). To simplify the notation in further 
calculations we assume that s, ¢ sq (qn = O in this case), and we parametrize 
thus S"~! by sy, ..., Sn—1. That is, 


Ta+At+ (jal +n) 
Tat lel tat) 


n—-1 
2a¢ +P(j).e +1 
_, Pio) [Is 


t=1 


Cus 


x (l-(sp +... + ron) ie ds, +++d5y—1 


_ Ta+A+ F(a] +n) by(st)?) Ts a+ 4Di,0 
Taat+lal+At+ 1) Ja, 


x (1 = (5; +... + Sp-1))™" ds1 +++ ds] 


Changing the variables: se = ug, for £ € xj, and sy = (1 —(ujit...+ U(j),k)) Ug, 
for g ¢ x; U {n}, we have 


Ta+A4+)D (jal +n) 
Tatlol+ath 


(Tye Zo tPO) = 


(ut) ll“ act spe 


Kj ley; 
—|laq|tn—kh-1 
x (1 — (uy Pi eh Hiya)" lawl ae I] dug 
ley; 
an 
xf TL t= ow) TD a 
An g¢ yjUKn} at jUKn} at xin} 


_Patatyrdaltny Magy Pa + 
Tint lal +a+1) Tel—lew|+nr—&) 


x / b; (uc 2 []« net ape (1 = (uq),1 Sins Se Pree) Re a I] dug. 
Ag 


lex; LEX; 


Taking into account that 


Tint+aAt]) Hee,, Tae + pe + 1) Tog; Ta, + 1 
T(n+ ja} +441) 


(2° +P(j) a +P(j) ) - 


we come to he following lemma (where we change uy for sz). 


Lemma 5.2 Let b; = bj(sq) € Loo (Be ') and let the portion pq of a tuple p = 
(P1,---.Pn) € Zi satisfies the condition |pq| = 0. Then the Toeplitz operator 
T, po acts on monomials z* as follows 

0) 
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0, if ¢ € x; such that a + pe < 0 
Tmo =y~ se 
oe Vf apserP, if VEE xj ae + pe > 0 
where 
T'(\a| + 7) 


Y. rg a)= 
Yo? aoa! ) TMeey, Poe + pe + DE (le| — law] +n—-&) 


/ i: (545) I act spe (1 _ (sq, ee ge I] ds 
Ak 


ci leyj ley; 


and bis) = = DCS 5).1, +++ S57). 
Corollary 5.3. The action of the operator T, p@ does not depend on the weight 
*() 


parameter i. 


Corollary 5.4 Let a be a bounded measurable k-quasi-radial function, let bj = 
bj(sq) € Loo (B') for each j = 1,...,m, and let the tuple p = (p1,...,Pn) € ZY 
satisfy the condition |piy| = 0 for each j = 1,...,m. Then the Toeplitz operators 
Ta, T, Po» for j = 1,...,m, pairwise commute. 

0) 


Remark 5.5 Itis oe to check that, contrary to the case of Corollary 3.4, 
we have that neither 7, Ty if = T mb? nor i, pot, ro = — i fo sj TOs for all 
tO) 


j # £ both from 1,. 


Now we can introduce new commutative Banach algebras which, in the setting of 
this section, are defined by the following data. We start with a tuple k = (ki, ..., Km) 
of positive integers with kj + ... + k, = n. Consider the set L..(t(B”)) of 
all k-quasi-radial symbols a, then fix a tuple b = (b,,...,b,) of functions 
bj = bj(s) € Loo(BY), j = 1,...,m, and a tuple p = (p1,...,pn) € Z" with 
IpG)| = 0 for all j = 1,...,m. We group all the above ingredients into a single set 
0 = {k, Loo(t(B”)), b, p} and denote by 7, (0) the unital Banach algebra generated 
by all Toeplitz operators 


Tz, with aéLo(t(B”)), and ae ce forall j= 1,...,m 
ij) 


acting on the weighted Bergman space At ( B”). By Corollary 5.4 each algebra of 
this type is commutative. 

Not entering into details, we mention just that the algebras 7, (0) possess the 
same properties as the algebras 7) (d) of section “Commutative Algebras”. All of 
them have the same invariant subspaces H,, with k € Z4+, they are not semi- 
simple, and have the same type on an ambiguity of representations of elements from 
corresponding dense subalgebras. 
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Remark 5.6 Two different procedures of the construction of commutative Banach 
algebras 7,(d) and 7,(0) can be even combined into a mixed single one. That 


is, given a tuple kK = (k,...,km) with kj +... + ky, = n, we start as in 
section “Quasi-radial and Pseudo-homogeneous Symbols” by representing z € B” 
in the form z = (Z(1),...,Z(m)), and then we represent each coordinate z;¢ of zj in 


the form z;¢ = rjS;,et;,¢, where 


kj-l 
= Vizak Feet lay? He ET. 5G) = (Sjar-+ Siu) ESE 


After that we proceed with the recipe of section “Yet Another Option” on each 
kj-1_ ; : 
“small sphere” Sj, j = 1,...,m (see Remark 5.1), i.e., we separate each portion 


pq) of ap = (p1,.--,Pn) € Z” onto smaller sub-portions py,) with |pgn)| = 0 and 
use, as generators, the Toeplitz operators with symbols ¢;,, = 5) (s(n) tea (and 
the Toeplitz operators with corresponding quasi-radial symbols). 
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Abstract We give here an “automatic” proof of a weighted embedding theorem 
with a bumping of the weight. It implies a well-known weighted theorem of C. 
Pérez. 
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Introduction 


Preliminaries 


In this note we give a simple Bellman function proof of Carlos Pérez’s “bump 
theorem” for the two weight estimates of maximal operators. 

The original question about two weight estimates for the singular integral 
operators is to find a necessary and sufficient condition on the weights v and u such 
that a Calder6n—Zygmund operator T : L?(v) > L?(u), 1 < p < ov, is bounded, 
i.e. the inequality 


fitruas < c/ lf |\Pudx Vf € L?(u) (1) 


holds. By weight we understand here non-negative locally integrable function. 
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There is a famous solution of this problem when T is the Hilbert transform that 
belongs to Misha Cotlar and Cora Sadosky [1]. This characterization is in the spirit 
of Helson—Szegé characterization of the one weight boundedness of the Hilbert 
transform. Helson—Szeg6 characterization can be expressed e.g. as the existence 
of an H'-function (here H! is a Hardy class of analytic functions in the disc) that 
“closely follows” the weight u = v. When two weights are present Cotlar-Sadosky 
[1] found the condition equivalent to (1) (with T being the Hilbert transform) in 
terms of the existence of an H!-function that “closely follows” the right combi- 
nation of two weights. Parallel to Helson—Szeg6’s characterization, the equivalent 
condition of Hunt-Muckenhoupt-Wheeden was quite important for the one weight 
theory. And then Sawyer’s characterization for the two weight problem for maximal 
operator appeared. All this asked for the parallel Muckenhoupt—Hunt—Wheeden 
or Sawyer’s language for two weight problems for general Calderén—Zygmund 
operators. However, it is worthwhile to mention that even in the one weight situation 
one has two different characterization of the boundedness of the Hilbert transform: 
(1) the Helson—Szeg6’s characterization and (2) the Hunt-Muckenhoupt—Wheeden 
characterization. They are equivalent of course, but the direct analytic proof of their 
equivalence is still unknown. In the two weight situation we consider here this is 
even more so that one can express the answer in several different languages. Cotlar— 
Sadosky characterization is the extension of Helson—Szegé’s one. 

The problem of finding the two weight characterization for the boundedness of 
the Calder6n—Zygmund operators T in Sawyer’s language has been recently solved 
for p = 2 when T is the Hilbert transform by M. Lacey, thus culminating a long 
search for the two weight T1 theorem by the group that included I. Uriarto-Tuero, 
E. Sawyer, C.-Y. Shen and earlier efforts of F. Nazarov, S. Treil, A. Volberg. For 
short range dyadic singular operators it has been solved by F. Nazarov, S. Treil, A. 
Volberg in 2008, [12], see also [14]. Of course the story of two weight estimates 
for positive operators can be traced to works of E. Sawyer [19, 20], the latter paper 
is devoted to the characterization of the two weight boundedness of the maximal 
operator. 

We are working here with the maximal operator again. But we are interested in 
simple and sharp sufficient conditions rather than in necessary and sufficient ones. 
As Pérez has shown this can be done by the use of Hunt-Muckenhoupt—Wheeden 
language by modifying it with the introduction of Orlicz type norms. We wish to 
mention here that the Pérez “bumping” by Orlicz norms [15-17] has been replaced 
recently by the entropy norms “bumping”, see [21]. We describe below the Orlicz 
bumping that we will be using in this paper. 

For the interesting operators acting on functions defined on R” the following two 
weight analogue of the A, condition is necessary for the boundedness of the operator 
Mair TM, 1p". Below Q denotes any cube in R”. 


p/p’ 
sup (io [ ud) (ior [ oa) <0o (2) 
Q Q Q 
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or in the symmetric form 


1/p 1/p’ 
sup (10° [ war) (ia [ oar) <0o (3) 
Q Q va) 


Simple counterexamples show that this condition is not sufficient for the 
boundedness. This holds even for the simplest “singular” operator: the Hardy— 
Littlewood maximal operator, and even for its dyadic counterpart. So a natural way 
to get a sufficient condition is to replace the L' norms of u and o in (3) (or the 
LP and L”’ norms of u!/? and o!/?’) by some stronger Orlicz norms (this is called 
“bumping” the L’? norms). 

This ideology of bumping can be traced to the work of C. Fefferman [9]. It has 
been widely used by D. Cruz-Uribe, C. Pérez, J. M. Martell, [2—8, 15-18]. 

Namely, given a Young function ® (convex increasing function) and a cube Q 
one can consider the normalized on Q Orlicz space L®(Q) with the norm given by 


d 
lle i= int} SO: fo(&) or < i 


And it was conjectured (for p = 2) that if the Young functions ®; and ® satisfy 
the condition 


dx 
i Jee, (4) 
D(x) 
then the condition 
sup lll og) lps2y, < (5) 


implies that for any bounded Calder6én—Zygmund operator T the operator 
M,/2TM,1/2 is bounded in L?. 

This conjecture (belonging to C. Pérez and D. Cruz-Uribe) was, in fact, proved 
by different methods in the paper of Lerner [10, 11] and in the paper [13]. The 
first paper has the advantage of giving the result for all p’s. The second paper deals 
only with p = 2 (which is still quite an interesting case) and it demonstrates an 
“automatic” proof of the bump conjecture. Namely, a function is constructed (on 
a certain infinite dimensional space) such, that after feeding into this function the 
distribution functions of the weights and several other data, we obtain the desired 
estimate just by applying Green’s formula to this function (we call it a Bellman 
function of the problem). 

But the paper [13] did not deal with the simplest “singular” operator: the maximal 
operator. This was just because for maximal operator the right bump conjecture 
has been already proved by Carlos Pérez, see [15-17]. But it seems to us that to 
give an “automatic” proof of Pérez’s result can be interesting too. Moreover, the 
main Theorem 2.1 formally proves a seemingly slightly more general result, which 
implies the maximal estimate with a bumping proved by Pérez. 


422 A. Volberg 


So in the present note we are concerned with the simplest case, namely we deal 
with a new “automatic” proof of the sharp bump result proved by Pérez [15-17] for 
maximal operators (and a slight generalization of this result). The bump condition 
here is slightly different, it is a one-sided bump condition. 

Our result differs in a way from Pérez’ theorem in that it has stronger regularity 
assumptions on the bump. We assume that ® is convex, and Pérez assumed that 
®(t’) is convex. The comparison between (4) and Pérez assumption f Bott <o 


plus a simple change of variables show that ®(s) = B(,/s). This relationship 
between our “bump” function ® (applied directly to our weight) and Pérez’ “bump” 
function B (applied to the square of the weight) shows that our assumption is 
stronger, but one should remember that the main assumption is (4), and thus the 
interesting case is ®(t) = td(t), where d is a sort of logarithmic correction. If d is 
regular enough there is no difference between these two convexities. This is the case, 
for example, if sf tends monotonically to zero when s tends to infinity. There is 
one more regularity assumption that we choose to impose on ®. It is formulated 
in (28) in terms of W that will be defined by ®. It is again satisfied for all sufficiently 
regular d. 

The proof has some value because of the above mentioned slight generalization 
(Theorem 2.1) and because it illustrates how one can give an “automatic” proof by 
presenting a formula for a certain function, which, in its turn, “automatically stops 
the time” (so no stopping time argument should be invented). On the other hand, the 
accent in difficulty is moved now to building such a function. And the main thrust 
is to ensure its specific concavity properties. 


Sharp Bump Conditions for the Maximal Operators 
and Weighted Embedding 


Let us consider the maximal operator acting on function on R”,n = 1. This is just 
for the sake of simplicity (the argument can be carried on for all n). In what follows 
J denotes an interval of the real line R. We prove here Pérez’ theorem [15-17]: 


[ Mevoy ua < oa gy’ o dt, (6) 
under the condition 
sup) Io lreuy <C, (7) 


which ports the name one-sided bump condition. 
Notice that we made here a change of variable in comparison with (1) and (5). 
In fact, for p = 2 inequality (1) can be rewritten as the boundedness of the 
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operator u!/?Tv—!/? in unweighted L?. So when one writes that one is interested 


in two weight boundedness of u!/?7o!/?, one should identify v—! in (1) with o: this 
identification has been done while writing down (5). 

In this section again we use a convenient change of variable, transforming the 
two weighted inequality in the form (1) 


[ Morus c | fod, 
into 
[ioe ayuar = cf yuan, 
v v 


Now we change the variable o = v~!, g = fv and we obtain the form (6) of two 
weight inequality with which we will be working now. 

The reader should keep this change of variables in mind because the two weight 
problems used to be formulated in different forms: in the form (1), or in the form of 
the boundedness of u!/?To'/? in unweighted L’, or, at last, in the form of (6), where 
the integration is with respect to the same measure odx in the right hand side and 
inside the operator. Notice that all forms are basically equivalent, but the latter has 
the advantage that measure odx can be easily made a general measure (and not just 
Lebesgue measure density). 

Condition (7) above is a strengthening of the classical Ay condition. Some 
strengthening is unavoidable as we are dealing here with a two-weight problem. 
The sharp condition on ® (called B, condition) will be quoted below, in essence it 
means [°° sot < 00. 

We prove this result of Carlos Pérez by a new method, actually by a formula. 
What follows is a sort of automatic proof of the result. 

It is well known that the problem for the classical maximal operator can be 
reduced to dyadic maximal operator (in this setting it is easy and follows by a simple 
trick with averaging over random dyadic lattices). 

Let us denote by D a standard dyadic lattice. Then it is also quite known that 
we can reduce our problem to the following: If {a;|I|}:ep is a u-Carleson sequence, 
then 


1 


inl Y= ai(o)? || < C(o),. (8) 


IED ICJ 
The requirement that a;|/| is a u-Carleson sequence means that 


1 


i > aill| < Clu),. (9) 


IED ICI 
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Since we bump v, we should care about 


1 
Nj(t) = wl eJ: o(x) = BI. 
Also we denote 
1 
Ay = | rs a,|I\. 


TED ICI 


The u-Carleson property of {a; - |J|}rep means A; < C(u), for all dyadic J. 
The combination of (7) and the u-Carleson property of {a; - |J|}rep means that 


VJ €D we have Aj: ||o||;o(7) < C < oo. (10) 


We want to deduce (8) from (10) under the sharp assumptions on the gauge 
function ®. These assumptions are as follows: ® is increasing convex function 
satisfying (4). 

So here is our theorem. 


Theorem 2.1 Let the sequence of nonnegative numbers {az\rep satisfy (10) for 
every dyadic interval J, where Ay = WT Vrepicy Ull|. Then if ® is increasing 
convex function satisfying 


ae 
—~dt < 
| ames” 
and satisfying a mild regularity condition then for every dyadic interval J the 
following inequality holds 


— S~ arlo)?|I| < Clo),. 


1 
lI IED ICI 


Notice that in the theorem there is no second weight u, there is no assumption of 
the type (9) whatsoever. In the standard deduction of two weight maximal theorem 
from Theorem 2.1 of course the second weight u will be present and the standard 
linearization of maximal operator will bring the sequence of nonnegative numbers 
{ar}iep that will satisfy (9). Then the application of Theorem 2.1 will prove the 
maximal estimate (in its form (8), but we know by the Sawyer’s result [19] that this 
is enough). 

However, it is not clear how to get Theorem 2.1 from the corresponding maximal 
result. It seems like for that one would need to build some special weight u, which 
is not present in Theorem 2.1. This is why this theorem seems to be a slight 
generalization of the result of Pérez. 


Bumps for Maximal Operators 425 


The mild regularity condition on ® is formulated in terms of its transform W that 
we are introducing in the next section “Orlicz Norms and Distribution Functions”. 
In terms of W this mild regularity condition is formulated at the very end of the 
paper in (28). In fact we believe that for every increasing convex ® satisfying the 
integrability condition in the theorem, there is a smaller function also satisfying the 
integrability condition and such that the new W will satisfy this mild regularity (28). 

Notice that the mild regularity condition (28) is satisfied for all examples of ® 
presented at the end of the next section “Orlicz Norms and Distribution Functions”. 

We explained that this result gives the bump result of Carlos Pérez. The latter is 
the sharp result and the integrability of 1/® cannot be weakened. On the other hand, 
notice that the statement of the theorem requires only (10) and does not require (9). 
So it is more the statement of “Carleson embedding theorem’, than the statement 
about maximal operator. After all, it might be important to have simple conditions 
on the sequence of nonnegative numbers {a;};ep and a weights o that would imply 
inequality (8). Notice that by Sawyer’s arguments we know that (8) then implies a 
general embedding theorem localized to interval J: 


1 
Wl > algo)i|I| < C(g’o),. (11) 


IEDICI 


Orlicz Norms and Distribution Functions 


Here we repeat verbatim some results from [13]. Orlicz norm is not very convenient 
to work with, so we would like to replace it, and to work with something more 
tractable. 


A Lower Bound for the Orlicz Norm 


Let ® be a continuous non-negative increasing convex function such that ®(0) = 0 
and ee at < +00. Define Y(s) parametrically by Y(s) = ®'(t) when s = 
rT ACSAO (t > 0). Then W(s) is positive and decreasing for s > 0 and sW(s) is 


increasing. Moreover h aol < +00. Indeed, using our parameterization we can 


rewrite the last integral as 
i. 1 4 o’ (t) is 
O(t) (1)? 


The first integral converges by our assumption and the second integrand has a 
bounded near +00 antiderivative aoe 
Let w > 0onJ C R". Define the normalized distribution function N of w by 
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Nit) =N7() = 7 {x eI: w(x) > 7} | (12) 


Lemma 2.2 Let Y : (0,1] ~ R+ be a decreasing function such that the function 
5 ++ sW(s) is increasing. Let ® be a Young function and let 


1 


W(s) < CO'(1) where s= EYOKZ6) 


for all sufficiently large t. Then for N = Ny 


ny (N) = [ N(YINE) dt < Chl (13) 


L°()* 


Proof The left hand side scales like a norm under multiplication by constants, so it 
is enough to show that if ||wl|zaq) < 1. ie., 


1 . / 
a foo =f N()O(Mdt <1 


then ny, (NV) is bounded by a constant. Since sW(s) increases, we may have trouble 
only at +00 It is cleat that it suffices to estimate the integral over the set where 
W(N(t)) > ©'(2) but since W is decreasing this means that N(t) < C/(®(t)®'(a)), 
So we get at most eae ®(t)~!dt and we are done. Oo 


Remark In the above Lemma 2.2 we do not need the assumption that 


1 


But in what follows this assumption will be needed, and the reasoning in the begin- 
ning of this section shows that for any Young function © satisfying [ *(@(t))!dt < 
oo we can find W from Lemma 2.2 satisfying (14). 
Examples 
In the above section only the behavior of ® at +00 and the behavior of V near 0 
were important, so we will concentrate our attention there. 

Let ®(t) = t(Int)*, a > 1 near oo. Then 

®'(t) ~ (Int)”, @(t)®' (rt) ~ t(In st)”, 


so W(s) := (In(1/s))® satisfies the assumptions of Lemma 2.2: to see that we notice 


In(®(t) 0’ (A) ~ Int. 
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If O(¢) = tin¢(inins)*, w > 1, then 
®'(t) ~ Int(InInt)®, ®(t)®'(t) ~ t(Int)? (In Int)” 
and W(s) = In(1/s)(In In(1/s))* works because again In(®(t) ®’(t)) ~ Inv. 
Note that in both examples f,(sW(s))~!ds < oo. 


The examples of Young functions with higher order logarithms are treated 
similarly. 


Bellman Function of a Problem 


Let us consider a function D(A, t, N) of three variables, where the last variable N is 
in fact any decreasing function on [0, oo) taking values in [0, 1]) and such that 


D(A,t,N) <C-N (15) 

di yD <0 (16) 
[o-e) 2 

i 2 as NOS K( [ neo«r) ; (17) 
9 OA 


where what is K will be determined later. The middle inequality means that for every 
t the function of A, N is concave. 

We will be looking for D of the following form D(A, t,N) = B(At, N) (scaling 
in (8) hints us to do that) , and we put for an arbitrary dyadic interval J €¢ D 


BU) = f BAr-t.Ny(o)ade 


What is a Priori Bounded? 


In this section we are going to discuss the following question. If (10) is satisfied, 
that is (after normalization) ||o||;*(yA7 < 1, then what quantities, involving ® and 
W from section “Orlicz Norms and Distribution Functions’, are bounded? 

First of all, we know that 


/ NA(A)U(N:()at < ello lleouy 
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and so our first inequality is 


a / Nj(t)W(Nj(t))dt <c. (18) 


What else? Well, to calculate the norm of o, we should integrate something like 
N7(t)®'(t), but not exactly! Let us be careful, and it will be the thing that we (at 
least I) was missing all this time. 

We know that ||o||;o¢) < a We also know that 


olen = inf {A (® (5)), < 1}. 


Therefore (the average in the inf decreases when A increases), we get 
(® (A;o)), < 1. 
We now write the last inequality as follows 


7 t 
®'(t)N, | — ) dt < 1, 
[ vo (+) < 


or 
Co 
arf N,(t)®’ (A;t)dt <i. (19) 
0 
This is the second inequality. Notice that we have ®’(A;t) instead of ©'(r), and 


this will be crucial. 


Next Step is Green’s Formula on the Tree 
and the Use of Concavity 


So, 
BU) = i BUA, - 1, Ny(0))dt, 


we can write 


and using that V(t) = es 
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B(I4) + BL) 
7) 


BU) = f [btar-t.myo)—8 (ASE 


“t, mic) |e 


[[(* +A; gle Ore (t) 
2 2 


) 5 (Ba -t,.Nr+(t) + B(AT- - 1, w-) | : 


In the second line we use concavity of B, thus, this term : ae a In the 
first line we use the mean value ee and the fact that as a <0 A eae! in 
variable A), we are ensured that $4 2 at the intermediate point is at least #8 SB (Art, N,(t)). 


Then we can continue (using oat A; — Arp Ale = az) 


={ 
2 
B()- BU) + BIL) ser Des >a f Fear, N,(t))dt > a(o a); (/ aia) 
vA Tt, LNT 


The last inequality is just Hélder inequality applied to 


2 
ia N(t 
oy = ( / ro [ ) fs + (Art N,(t))dt 
0 yt 128 (Art, N7(t)) 
We want 
N,(t)* 
/ a_i (20) 
t 54 (Ayt, Ni(t)) 
which will be satisfied if we take (18), (19) in combination with 
Nr (t 1 
(0 o 


oe Ayt, Ni(t 
40 MO) 2 Ga) + Or) Ar 
After dividing by t becomes 


Ni(t) A 
oF Aut NO) = GM) + OGr) Apt 


So it is tempting to put 


N 1 
W(N) + OA) A’ 


OB 
—(A,N) = 
5 AN) 


In fact this formula cannot give us the desired B. But in section “Here is B” we will 
modify the formula. 
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Here is B 


Recall the notations: we start with convex ®, whose reciprocal is integrable at oo, 
and we build W(s), (s) = sW(s), 5 = 0, in such a way that 


1 1 
WV(s) = —., ifs = ——_., 22 
oP) = Sa? 1 = Gow CA 
which implies that 
W(s) = O'(2), if : (23) 
s)= , if s = ——.. 
! (t) B(t) 
We want to combine that with 
! ! 1 : 1 
O'(t) < ® | —), if s= ——_., (24) 
KY ®'(t) B(t) 
Together (23) and (24) give that 
! 1 : 
W(s) < D[ -],ifsxO. (25) 
Ss 


which we will use in what follows. 
But we need to check first (24), which is the same (as ®’ is increasing) as to 
verify that 


ts 


1, 1 
= (26) 
S 


ee BNO 


This is immediate: 1 = O'(t)P(t) > t just because B(t) > 1, O'(t) > 1 if ¢ is 
sufficiently large (this is a simple consequence of convexity of ® and condition (4)). 
Let us introduce now 


W(s), 5 € (0, 1], 
W(4).s>1. 


Ss 


Wrew (s) = 


Recall that we are looking for D(A, t, N) = B(At, N) with properties listed at the 
beginning of the Section. Here is B that will give us the function D we need: 


1 ds 


¥ 
BN) = N f aie ae 
( 0 Wrew(s) s 
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Obviously 

0<B<CN 
because TIO is = ~ integrable on (0, 1], and m0 ae 5 is ~ integrable on [1, 00). 

The Seterathant of Hessian matrix is zero, so concavity is fe to B,, < 0. 

But 

N we 

Be, = =) (1+5 2D) <0 
uC) N W(x) 


when t/N < 1 because sW(s) is increasing on [0, 1]. In the other case N/t < 1 we 


have B(t,N) := N fx "TG . as 


ee (=tt%))z 
wo p(®) t w(X) ) ~ 


because W(s) is decreasing on (0, 1]. 
We are left to prove that (20) holds, namely, that 


oo N*(t) 
i, iBI(At, N@) (rN) dt<C. (27) 


This is the same as 


si= A [NOY (7) dt<Co. 


We split integral S to three parts. Integral S,, where Oe < 1, so At < N(t) < 1 (we 
use that W is decreasing on (0, 1]): 


i 1 
Si <a N(t)V (=) dt < <a | wan dr = f W(s)ds = C; < oo. 


In fact, using, for example, (24) we see that [, U(s)ds < C [™ SO dt we f° oe dt. 
The latter integral is finite because in all interesting cases we can assume P(t) < 
t°/, In fact, for power bump functions © the result we are proving can be proved in 
a much easier way, see for example [9], (and, in fact, it follows of course from the 
result for more complicated bumps). 


Integral Sy is where _ > 1,At < 1. We use that W is decreasing on (0, 1]. 


S> =a [now (=) dt < af N(t)W(N(t)) dt < Co, 


which we know from section “What is a Priori Bounded?’’. 
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Finally we are left with integral 53, where At > 1. 


safe vow (* ) at. 


Here we use the following property of Y, which is satisfied for all reasonable 
W obtained from © (it is a small restriction on regularity of ®, but notice that all 
interesting W’s are (sub)-logarithmic and so have this property): 


Vso, 82 € (0, 1], Y(s1s2) < C(U(s1) + V(s2)). (28) 


Using (28) we continue 


h< caf N(t)Y(N(1)) dt + ca fo N(H)w (5 ) dt. 


The first integral is < CC3 by (18) of section “What is a Priori Bounded?”. 
For the second integral we use (25) and (19): 


af vow (<. Jaca fe N(t)®’(Ad) dt, 


which is again bounded by C4 by (19) of section “What is a Priori Bounded?” 
We are done. 
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The Necessity of A,, for Translation and Scale 
Invariant Almost-Orthogonality 


Michael Wilson 


Abstract If v is a measure, we say a set {Wx}, C L?(v) is almost-orthogonal in 
L?(v) if there is an R < oo such that, for all finite linear sums )> Agr, 


/ > evn) dv < RY Axl’. 


If z = (t,y) € R4t! = R¢ x (0,00) and f : R4 > C, define f(x) = f((x —1)/y). 
If Q C R¢ is a cube with sidelength €(Q), define T(Q) = Q x [¢(Q)/2, £(Q)). 
We say that {¢;}/, a finite set of bounded, complex-valued functions with supports 
contained in B(0; 1), satisfies the collective non-degeneracy condition (CNDC) if 
there is no ray emanating from the origin on which the Fourier transform of every ¢, 
vanishes identically. We prove: If jz is a doubling measure on R¢ with the property 
that, for some family {¢;}/ satisfying CNDC, it is the case that, forevery 1<k<n 
and every choice of points €(Q) € T(Q), Q € D (where D is the family of dyadic 
cubes), the set 


(Px)eo) 
vL(Q) 


is almost-orthogonal in L*(j1), then pz is a Muckenhoupt Ago measure. 


QED 
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Introduction 


We recall that a non-trivial Radon measure v on R¢ is said to be Ago (in symbols: 
v € Ago) if, for every « > 0, there is ad > 0 such that, for every cube 0 C Ré 
and every measurable E C Q, having |E|/|Q| < 6 implies v(E) < €v(Q); where, 
here and in the future, we use | - | to denote a set’s Lebesgue measure. A non-trivial 
Radon measure v on R¢ is said to be doubling if there is a finite C so that, for all 
cubes Q C R¢, v(2Q) < Cv(Q), where 20 denotes Q’s concentric double. It is easy 
to see that v € Ago implies that v is doubling; it is not so easy (but classical) that 
the converse fails. If v € Ago then dv = v dx for some non-negative v € Lh,.(R4 ). 
In such a case we say that v € Ago. It is well known that v € Ago if and only if there 
isap > | anda finite K, such that, for all cubes Q, 


1 ; 1/p K, 
= d. < — dx, 1 
(al ") sg fee ” 


which is the so-called “reverse-H6lder inequality”. 

In a recent paper [9] the author proved that, if 44 € Aoo, then, in a precise sense 
to be explained shortly, L7(j1) and ordinary, Lebesgue-measure L? have the same 
almost-orthogonal systems; where we say that a collection of functions {yx}, is 
almost-orthogonal in L?(v) if there is a finite R so that, for all finite linear sums 


DAVE 


[ [Xref av ek Dae. 2) 


He also proved that if jz is a doubling measure and L? and L?(j) have (in a precise 
sense) the same almost-orthogonal systems, then jz must be Ag. 

Let us explain what this “precise sense” is. 

Ifz = (t,y) € R4t! = R¢ x (0,00) and f : R’ > C, we define f,(x) to 
be f((« — t)/y). This is the function f dilated and translated relative to the ball 
B(t; y), but without any measure-based normalization. If 0 < @ < 1 we say that 
@ € Cy if ¢ : R4 > C has support contained in B(0; 1) and, for all x and x’ in R¢, 
d(x) -—$(’)| < |x—2’|*. We write C, 9 to mean the subspace of @’s in C, satisfying 
f ¢dx = 0. We call a cube Q dyadic if OQ = [j,2*, (j, + 1)2*) x-+ +x [ja2*, (ja + 2") 
for some integers j,,... , jg, and k, and we write £(Q) for Q’s sidelength (which is 
2"), We call the set of all dyadic cubes D. If Q € D we put zg = (xg, £(Q)) € R4*', 
where xg is Q’s center. If {6p C Cy, then 


(Q) 
Pea | (3) 
10) ses 
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is a family of H6élder-smooth functions, indexed over D, with each one dilated, 
translated, and (Lebesgue) measure-normalized to “fit” a dyadic cube Q. If each 
2 € Cyo then it is easy to see that (3) is almost-orthogonal in L”, with an R (as 
in (2)) that only depends on a and d. If each # equals a fixed @ € Cyo (a “mother 
wavelet”) then (3) is sometimes called a wavelet system [2]. 

We could also consider the collection 


(Q) 
db: 


Vv L(Q) 


In [9] the author showed that, if 42 € Ago then, for every family {@@}p C Cg, the 
set (3) is almost-orthogonal in L? if and only if (4) is almost-orthogonal in L?(jz). 
He showed that this result has a partial converse: if jz is a doubling measure and it 
is the case that, for every {f')}p C Cy, the L* almost-orthogonality of (3) implies 
the L?(jz) almost-orthogonality of (4), then pz € Aoo. 

In a later paper [10] the author strengthened the converse. We define a T-sequence 
to be a function ¢ mapping from D into Rit! such that €(Q) € T(Q) for all O € D. 
In [10] the author proved that if yz is doubling, and ¢ is any non-trivial, real, radial 
function in Cy such that, for all T-sequences ¢, the family 


(4) 


QED 


$¢(0) 


Vv E(Q) 


is almost-orthogonal in L?(jz), then pt € Aco. 

The hypotheses that ¢ be real and radial are unnecessary. The “real” assumption 
is a computational convenience. The “radial” hypothesis (combined with non- 
triviality) simply ensures that d (the Fourier transform of @) does not vanish 
identically on any ray emanating from the origin. It turns out that smoothness and 
cancelation are also red herrings, at least for showing necessity of “ € Ago. In the 
current work we replace these hypotheses with a non-degeneracy condition that can 
be applied to subsets of L°°(B(0; 1)) (bounded functions with supports contained 
in B(O; 1)). This condition allows individual functions in the set to have Fourier 
transforms with bad directions. It only requires that no direction be bad for all 
of them. Precisely, we say that {¢;,}7 C L°°(B(0; 1)) satisfies the collective non- 
degeneracy condition (CNDC) if there is no ray from the origin on which every bx 
is identically 0. 

Our main result is: 


Theorem 1.1 Let yu be a doubling measure on R4 and let {dx}; C L©(B(O; 1) 
satisfy CNDC. If, for every 1 < k < nand every T-sequence 6, the set 


(5) 


QED 


(di)e(o) 
Vv H(Q) 


is almost-orthogonal in L?(1), then fb € Ago. 


(6) 


QED 
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The meaning of the theorem seems to be: If jz is doubling and L7(y) has a 
reasonable wavelet basis (one given by normalized translates/dilates of a finite set 
of mother wavelets), then jz must be Ago. 

The proof uses a slightly non-standard characterization of Ago; or, to be more 
precise, dyadic Ago. We recall that a measure v belongs to dyadic Ago (in symbols: 
ve Ad.) if, for every € > 0, there is ad > 0 so that, for all dyadic cubes Q and 
all measurable E C Q, |E|/|Q| < 6 implies v(E) < €v(Q). Obviously Ago C A%. 
It is not hard to show that if v € A% and v is doubling then v € Ago. To prove 
Theorem 1.1, it suffices to show that its hypotheses imply  € A. 

We will call {cg}p, a sequence of non-negative numbers indexed over D, a 
Carleson sequence if, for all Q’ € D, 


Y= colQ| < |0'l. (7) 
oo’ 


This is the same as saying that, for every Q’ € D, 


[ Y= coxe | ax < I'l. 


QED 
aca’ 


In section “The One-Dimensional, Dyadic Case” we show that v € A%, if and only 
if there is a finite R so that, for all Carleson sequences {cg}p and all Q’ € D, 


Y= cov(Q) < Rv(Q’): (8) 
QED 
ecg’ 


which, the reader will note, is the same as 


[ > coxo | dv < Rv(Q’). 


QED 
oc’ 


We prove Theorem 1.1 by showing that, given its hypotheses, jz must satisfy (8), 
for some fixed R, for all Q’ € D and all Carleson sequences. 

Aside from some technical lemmas, the proof turns on a simple observation. 
Suppose that (Q, M, v) is a measure space, and f : Q — C satisfies 


[tase lf|dv < 00 (9) 
Q Q 


for some finite R. Then the Cauchy-Schwarz inequality implies 


/ Ifldv < Rv(Q). (10) 
Q 
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(We need the ‘< oo’ in (9): consider f(x) = 1/x on (0, 1) with Lebesgue measure.) 
In the proof of Theorem 1.1, Q will be a certain “nearly optimal” Q’ € D and f will 
essentially be a function of the form 


y coxa, 


QED 
aca’ 


with {cg}p a “nearly optimal” Carleson sequence, carefully defined to have the 
second inequality in (9). After some work, Theorem 1.1’s almost-orthogonality 
hypothesis will yield the first inequality in (9), giving us (10) (and (8)). 

What seems to be going on here is a sneaky version of the self-improving (“John- 
Nirenberg”) property of BMO. Recall that f € L}, (R®) is said to belong to BMO if 


loc 


1 
sup ff —folde = If lle <x. a 
ocrd |Q| Q 
Qacube 
where fg denotes al if 0 f dx, f’s average over Q. The John-Nirenberg theorem ([4], 
p. 144) states that there are postive constants c;(d) and c2(d) such that, if f ¢ BMO, 
then for all cubes Q and all numbers A > 0, 


lix€ Q: [F(@%) —fol > A$] S c1@ exp(—e2(@)A/Ifll)1QI- 


This implies that if (11) holds then 


1 
sup — 
ocrd |O| 


Qacube 


[vf avs cw 


for some C depending only on d. In other words, 


2 


1 7 1 
—— _ d. a = , 
sup 0] fv fol’ dx < C] sup ol fv fol dx 


ocr¢d ocr¢ 
Qacube Qacube 


“the L' norm controls the L? norm.” 

Because we will need it later, we recall that f € L},.(R”) is said to belong to 
dyadic BMO (“f € BMO,’) if the inequalty (11) holds when the supremum is 
taken over all dyadic cubes. We write the resulting (finite) supremum as |||. a. 
The analogous John-Nirenberg properties also hold for f € BMO ,, with the cubes 
now required to belong to D. 

In section “The One-Dimensional, Dyadic Case” we state and prove a dyadic 
version of our main result, hoping it will illuminate the main ideas in the proof of 
Theorem 1.1. 

In section “Technical Lemmas” we prove some technical lemmas. 
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In section “Proof of Theorem 1.1” we prove Theorem 1.1 and give, as a corollary, 
an application to wavelet representations of linear operators. 

Notations. If A and B are positive quantities depending on some parameters, we 
write ‘A ~ B’ (“A and B are comparable”) to mean that there are positive numbers 
c, and cz (“comparability constants”) so that 


cA < B< oA: (12) 


and, if cy and cp depend on parameters, they do not do so in a way that makes (12) 
trivial. We often use ‘C’ to denote a constant that might change from occurrence to 
occurrence; we will not always say how C changes or what it depends on. If F and 
F are sets, we write E C F to express E C F. 

We will refer to “finite linear sums” of the form Der Ay8y(x), where {Ay} is 
a set of numbers and {g,}p is a set of functions, both indexed over an infinite set 
(typically D). “Finite linear sum” will mean a sum in which all but finitely many of 
the 1,’s are 0. Similarly, a “finite sequence” {A,,}r indexed over I’ will be one in 
which all but finitely many A,’s are 0. 

We indicate the end of a proof with the symbol &. 


The One-Dimensional, Dyadic Case 


First we prove our characterization of AY (8) (see [7] and [11] for its original form). 


Lemma 2.1 A Radon measure 1 belongs to A“, if and only if there is a finite R so 
that (8) holds for all Carleson sequences {cg}p and all Q' € D. 


Proof of Lemma 2.1 Suppose that 4 € A4,. Then ju is absolutely continuous, and 
we can write du = vdx, withv € Ae Classical arguments (see [1]) show that 
v satisfies (1) with respect to dyadic cubes, for some p > 1. Let M,(-) denote the 
dyadic Hardy-Littlewood maximal operator: 


Malla) = sup 5, le(t)| dt. 


The L?-boundedness of M;,(-) and Hélder’s inequality imply, for any Q’ € D, 


] 1/p 
real ae Ma(xorv) dx < (sa J secon? a) 


1/p 
ola Lome) 


CK, 
|2'| 


v(x) dx; 
Q’ 
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1.€., 
; Ma(xorv) dx < Cv(Q’) 
Q’ 


for all Q’ € D. Now let {cg}p be a Carleson sequence. If Q’ € D then 


Y~ cov(Q) = Y> coll (j 5) £ [ Maxtor) dx, 


QacQ’ ecg’ 


by standard tent-space arguments (see, e.g., Theorem 2 on page 59 of [4]). Therefore 
be AS implies (8). 


Suppose (8) holds. First we will show that jz is absolutely continuous with respect 
to Lebesgue measure. Then we will finish the lemma’s proof. 

Suppose E is measurable, |E| = 0 and, without loss of generality, E C Qy € D. 
Cover E with countably many disjoint cubes Q| C Qo such that 


2il2il s (1/2)|Qol. 


Now, having chosen the cubes {J)}), let {0 4,3; be a family of disjoint dyadic 
cubes such that: a) E C Uj Ox ; b) each Or, is a subset of some Q}; c) for all 0), 


> Qail < C/21Q4. (13) 
Of 4 CO) 
We can do this for all k because |E| = 0. Inequality (13) implies that, for any Q € D, 


y= 12h) < 2101. (14) 


O,.co 


We give the quick (and well known) proof of (14). By induction, for any oy and any 
n= 0, 


> nl < 2104 


7 ' 
Qn Cc OQ 
which implies that 


Y> Ohl < 210% 


a,CO, 
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for every QO). If Q is arbitrary let oe heuer the maximal O)’s contained in Q. The 


ak 
cubes On are disjoint. Therefore 


YL iM=>> Y lat s2>d Ohl <2Ig1, 


; ok [Ok ok pe fOk 
aco PH" FCO mn 


proving (14). 
Define: 


pa {l/2 if Oe {Opa 


0 otherwise. 


Inequalities (13) and (14) imply that {cg}-p is Carleson. Therefore there is a finite R 
such that 


Y“(1/2) (Qh) < Ru(Qo) < cv. 


jk 
But, because of a), for all N, 


N 


Nu(E) < ¥> Y° (OQ) < 2Ru(Qo), 


k=1 j 


forcing w(E) = 0. 

The rest of the proof that 4p € A%, is like what we just saw, only more careful. 
Let Qo € D, E C Qo, and |E|/|Qo| < n << 1. Fork > 1, let {Oj be the maximal 
dyadic subcubes of Qo such that 


JEN ol S 24+ Dk 
|Q;.| 


These are the Calderén-Zygmund cubes, taken at “height” 24+Dkn, of yz relative 
to Qo. Because of their maximality, for each om 


JEN Q| 
fe 


< 249+ ky = (1/2)24+ DED p 


which implies that every cube of 4, is contained in some 0), and that, for every 0), 


> -iGal Sac). 
0460, 
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which is the condition (13) we saw earlier. The same reasoning as before implies 
that, for all O € D, 


» Ig =2)0). 


oico 


Almost every point of E is a point of density. Therefore we will keep getting 
cubes Q), as long as 2¢*)*y is less than 1: there is a Ko ~ log(1/7) such that, for 


all 1 <k < Ko, |IE\ Uji] = = 0, and hence (E \ U;Q),) = = 0. (The union UjQ), 
“almost contains” E.) Define: 


1/2 if O€ {ON je 


0 otherwise. 


The sequence {cg}p is Carleson; therefore 


Y= cou(Q) < Ru(Qo). 


QCO 


But 


Y= cou(Q) = (1/2) 2H} ) = (1/2) s D> w(Q) = (1/2)Ko(E). 


QCO k=l j 


because, for each k < Ko, the part of EF outside UO), has j1-measure 0. Thus, 


w(E) < 1), 
Ko 


and 2R/Ky > 0asn > 07: € AX. & 
If 7 = [j2*, G+1)2*) C Risa dyadic interval, define J+ = [2j2*!, (2+1)2'!) 
(I’s left half) and J~ = [(2j + 1)2*"!, (2j + 2)2*—') (/’s right half), and set 


hw = Xr+ — xXr-- 


The functions {hi /|I|'/?};ep are known as the Haar functions, which comprise an 
orthonormal basis for L?(R). 


444 M. Wilson 


The dyadic analogue of Theorem 1.1 is 


Theorem 2.2 Let yt be a non-trivial Radon measure on R. If 


ha 


sn 15 
WAT) sa 


IED 


is almost-orthogonal in L?(t) then ju € A%. 


Proof of Theorem 2.2. The reader might want to look back at (9) and (10). 

Fix Il) € DandO < € << €(Ip). Let F(Io,€) be the familiy of Carleson 
sequences {c;}p such that cr = Oif J ¢ Jp or (J) < e. By compactness, there 
is a Carleson sequence {¢;}p € F (Jo, €) such that 


ra) = sup | wa) : {ci}p € Fas.ol < 00. 


D D 


Call the supremum L. Define 


P(x) = Yim) - (x ait) a 
2 D ol 


Notice that, because {C;}p is Carleson, 


1 ( . 
i (dan) = 
ol \S 


The function f is supported on Jp and satisfies { f dx = 0. Also, f belongs to BMO,, 
with ||f||«.a < 2. Let us prove this fact. Take J € D. If JN Ip = O we have nothing 
to prove. If Jo C J then f; = 0 and 


[v-nia <2 0 Gl < 2ldol < 2IJI. 
J D 
If J C Jp then 


[-macs2 Yen sr, 
J 


TeDiUICI 


By the John-Nirenberg theorem, there exists an absolute constant—which we call 
C—so that, for all J € D, 
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lf, hen) P 
[v-nta= > ae = (16) 
d IeD: ICJ 
where (-,-) denotes the usual (Lebesgue) L” inner product. Because of how we 


defined f, the inner products (f, iin) = Oif 1 Z Jy or LW) < e. Therefore the 
sequence defined by 


_ ho)? 
UP 


is a bounded multiple of a sequence from F (/o, €), implying 
’ h 7 
5 hall wan < ct, 


with C an absolute constant. 
We can write 


nae 
where 
i, 


= (fh) 
The L*(,1) almost-orthogonality of (15) implies that 
KL) 
[Pan < RD InP = RD mo Pe 
D 


h 
=e () 


< RCL. 
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But 
L= au) = [ot co) dp, 
D fo 
where 
alls 
C= Cy = 1. 
ol & 
Therefore 


/ If? du < RC (/ Ifldy + wh). 


which implies 


/ du < C'wlh), 


and 


an <c" ( / Ifldue + wh) < Cullo). (17) 


D 


The sequence {c;}p is optimal for sequences from F (Jo, €). Therefore (17) holds for 
every sequence in F (Jo, €). But the bound holds independent of Jo and ¢; therefore, 
by an obvious limiting argument, it holds for all Carleson sequences {c;}p. By 
Lemma 2.1, the measure jz belongs to A4.. & 


Remark We ask the reader to note how, in the interaction between (16) and (17), 
the John-Nirenberg theorem lets us bound an L? norm by an L! norm—which is the 
heart of the proof. 


Technical Lemmas 


The first lemma in this section says that, if every family of the form (6) is almost- 
orthogonal in L?(j), then these families must be, in an obvious sense, uniformly 
almost-orthogonal. 


Lemma 3.1 Let y € L™®(B(0; 1)). Suppose that, for every T-sequence €, there is a 
finite R = R(€, 4, W) such that, for all finite linear sums 


OM 
Xo 
d Ae Vu(Q) 
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we have 


Veo) 


mvsTICe) 


Then there is a finite R = R(1, v) such that (18) holds for all T-sequences €. 


2 
du < RY Ag). (18) 
D 


Proof of Lemma 3.1 For every T-sequence ¢, we can define a linear map L; : 
02(D) > L2(2) by 


L¢({Ag}D) = D0 Tree (19) 


Inequality (18) shows that the series in (19) converges unconditionally to an 
f € L’(w), and that f |f|/?du < R>°p|Aol|’. By the Uniform Boundedness 
Principle, if no universal R exists, then there is a sequence {Ag}p € €7(D) such 
that 5°, |Ao|? < 1, and there is a sequence of T-sequences ¢;, such that 


2 
Va@Q hike: (20) 


oro 


We will patch together a T-sequence c such that { Jip} is not almost-orthogonal. 


Fix the sequence {Ag}p. If F C D is finite, there is an N = N(F) such that 


2 


{| oe 


OQEF 


for all T-sequences ¢. Thus, because of (20), we know that, if Fy) C D is finite and 
R is any large number, there is a finite subset 7, C D, disjoint from Fo, and there 
is a T-sequence ¢,, such that 


2 


[|X we du>R 


QEF; 


Let Ry > oo. Let F; C D be a finite subset and ¢, a T-sequence such that 


2 


Yao) 
Ago d Ri. 
(Ap On 


QEF\ 
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Having defined F,, F2,... , Fn, let F,4,; C D bea finite subset disjoint from U}F,, 
and ¢,,4 1 a T-sequence such that 


2 


Vong 
i ye Ao— dit > Rn+i- 
QEF y4+1 H(Q) 


Define €: D > R4*! by 


Hoye i if Qe Fi 
ZO if O € UF x. 


Then E is a T-sequence for which (18) fails. & 


The proof of Theorem 1.1 uses a general form of the Calderon reproducing 
formula. Our approach is based on ideas and methods of Frazier, Jawerth, and 
Weiss [3]. We gratefully acknowledge their influence and inspiration. 

Recall that if w € Cy.o is real, radial, non-trivial, and normalized so that 


[ woHro ae 


for all & 4 0, then, if f € igsdesl? (R°), we have 
- = dt dy 
£0) = fF * Won) Hoye —9 
RY y 


in various senses [8, 11]. To be consistent with the notation in the introduction, we 
have written “yA Woy” in place of the more traditional “y,”. We will continue to 
follow this convention. 

We define ®(x) to be the inverse Fourier transform of exp(—|§ |? — |g| =) 
We notice that ® belongs to the Schwartz class S, and that OE ) and all of ®’s 
derivatives vanish to infinite order at the origin. 

It is important that &(E) > 0 on all of R@ \ {0}. 


Lemma 3.2 Suppose that {¢,}7 C L©(B(0; 1)) satisfies CNDC. For & € R@ \ {0} 
define 


oo n ae d 
G(é) = / B(yE) (> on?) - (21) 
1 


The function G(€) is infinitely differentiable on R“ \ {0} and homogeneous of degree 
0: G(t&) = G(§) for all t > 0. There are positive numbers c, and cz such that 
c) < GE) < c@ forallE £0. 


The Necessity of Ago for Translation and Scale Invariant Almost-Orthogonality 449 


Proof of lemma. The homogeneity is obvious. Every dx i is infinitely differentiable, 
and DY“ hy, € L® for every k and multi-index a. The function @ is also infinitely 
differentiable, and, for all a, D°® vanishes rapidly at 0 and infinity. These imply 
that G is infinitely differentiable. The CNDC implies that G(&) never vanishes on 
S¢-! = {€ : |&| = 1}. The smoothness of G and the compactness of S“~! imply 
that G lies between two positive constants there, hence on all of R@ \ {0}. & 


Now, given {d¢x}7 C L°(B(0; 1)) satisfying CNDC, and G as defined by (21), 
we set 


m(é) = ae (22) 


for & # 0, and undefined at the origin. By standard arguments ((4], p. 26), the 
Fourier multiplier operators given by 


Tof (E) = GE) 
and 
Tf (€) = m(EfE), 
initially defined for f € C°(R“), extend to bounded operators on L?(R“) for every 


1 < p < &. On these domains they are inverses of each other: TgT, = TnTg = I, 
the identity. 


For each x, define d(x) = ,(—x), and recall that b(€) = dx(E). If f € 17(R%) 
then 


is 7 a" 2 dt dy 
Tof = Y foul *¥ “Bey * OP 0yO) “Pad oy) & — 1) — 
1 Ry 
where we interpret each integral as 


R ee d 
tim f (634205 #0090) OMDa.yoe— Har) &. 
€ Rd y 


with the limit existing in L*. As we shall see, if f € CS°(R%), the limit also exists 
pointwise in x, with the integral being, in a natural sense, absolutely convergent. 
Because T,, and Tg are inverses of each other, if f € Cf° (R’), 


. , dtd 
ie Load * Tn “Poy) * “bid 0y)O) O “KO — 1) = 
i 7 RE 
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where the integrals converge (in the above sense) in L’. Let us define 
V(x) = Tn()Q). 


With this notation, we can rewrite the preceding integral formula as 
” 7 a _ dt dy 
= Dy *¥ Woy * OPO) O “Pd Oy) 0-1 a 
1 + 


(We have used the dilation-invariance of T,,,.) 
A look at W’s Fourier transform shows that Y € S and f W dx = 0. The same 
are true of Y;,, which we define as 


V(x) = VY * G(x). 


With this convention we can compress our integral formula to 
” _ = dt dy 
F=D fa F*enOM OM M0ne-9 22,3) 
1 °Ry 


We now prove two lemmas relating to (23). 


Lemma 3.3 Suppose that € S, fT dx = 0, and y € L™®(B(0;1)). There is a 
C = C(I, y) such that, if f € C&°(R“) satisfies |Vf| < A pointwise and B is any 
positive number, then 


B 
> = dy 
[ (fle +9Pon 0 e6= 0) at) & = cas, 
0 R¢ y 
Remark In our applications of Lemma 3.3, T = Wy, y = dx, and AB ~ 1. 


Proof of Lemma 3.3 The function I satisfies 


[P(x)| < CU + |x|)-4? 
[Vr@| < Cd + [a4 


[re dx = 0, 


for a fixed constant C. A lemma of Uchiyama [6] says that we can decompose T° 
into a rapidly converging sum of dilates of smooth, compactly supported functions, 
with integrals equal to 0. Precisely: 


lo. e) 
Ta) =C) 279 (F)on@), 
j=0 
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for an appropriate C, where each F; has support contained in B(0; 1) and satisfies 


IIFilloo $C 


[Fia=o. 


(Uchiyama’s lemma actually yields ||VFj||oo < C, but we don’t need that.) The 
function (F;)(o,2), has support contained in B(0; 2/) and the function ((F; 1) (0.2/)) Oy) 
has support contained in B(0; 2/y). The smoothness of f and the cancelation in F; 
imply that 
lf * y “(Hono < CA2ylly “(Foo lh 
< CAQy24 = CAWETDy 


for any ft, and therefore 


lo.) 
fay“ Loy Ol = CA >. 2 yey 
j=0 


= CAy. 


Since |ly||1 < C(y), 
ie lf xy “Toy @)¥ “voy — | dt < CAy, 
implying 


B : ? ’ 
i. ae |(f * y “Toy@) y “yoy — 1) | a) = < y (Cay) 


= CAB, 


proving the lemma. & 
The next lemma uses a standard definition and one derived from it. 


se : Pe d+1 
Definition 3.4 If Q C R¢ is a cube then we set 0 = Q x (0,4(Q)) C ra 
(sometimes called the “Carleson box” above Q) and R(Q) = {(t,y) € RQ” : 
d((t,y),Q) = €(Q)}, where d(-,-) denotes the usual Euclidean distance to a set 
in RG* 

+ 


Lemma 3.5 Let € S and y € L®(B(0; 1)). There is constant C = C(T, y) such 
that if f € L'!(R®) and the support of f is contained in a cube Q then 
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= a dt dy C 
i eTond)“vane-)| 22 <= / Ifldt 
R(Q) y |O| 


forallx €@Q. 


Proof of Lemma 3.5. For j = 0,1,2..., define R\(Q) = {(t,y) € RQ) 
2i€(O) < d((t,y),O) < 2/*1€(Q)}, and observe that R(Q) = Us°Ri(Q). Since y 
has its support contained in B(0; 1), yo;y\@—-—1) = vee) can be non-zero only 
if |x — t] < y. Therefore there is a positive c = c(d) such that, if x € Q and 
(t,y) € R\(Q), yo.) (x—2) will be zero unless y > c2/£(Q). If y > c2/€(Q), Hélder’s 
inequality implies 


If «Toy Ol < CLO Iflh 
and 
[le #*Ton09on- 9] d= CRON. 
If (t, y) € Rj(Q) then y < 2/+2€(Q). Therefore: 
Sao (6 *¥¥ “Toy O) ¥ “Voy & — 1) oo 


Hh 
2c. (fee |(F *¥ “To OY“ Voy &— | at) 2 


< C(O)“ lh. 


Summing over j finishes the proof. & 


Proof of Theorem 1.1. 


For the rest of this section, jz will be a fixed doubling measure. 
The proof of Theorem 1.1 works by rewriting each of the n summands in (23) as 
an average of sums of the form 


ie (eo 

TO) 
where ¢ is a acueaee We now describe how this rewriting will Be. IfQ= 
[2 Gi + + 1)2*) XxX ia2*, Ga + 1)2*) € D we set 9 = — (j12 a ae sng qa ), 
the “left-most corner” of Q. Define Vo = [0, 1)“, the “unit” dyadic cube. If QED, 
we define a bijective mapping o(Q,-,-) : T(Vo) > T(Q) by 


The Necessity of Ago for Translation and Scale Invariant Almost-Orthogonality 453 


a(Q,t. 1) = (tg + €(Q)t, €(Q)n). 


We point out some properties of this mapping. If g : T(Q) — C is measurable we 
can define h : T(Vo) > C by h(t, 7) = g(o(Q,t, 7)). By the change-of-variables 


formula, 
d 
I. att,y) <2 = jal f a (24) 
T(Q) 


dtd 
i (fF xy 400) yoo —) 22 
T(Q) y 


We can write 


as 
= ———— dt dy 
i y EF, Gdeesy) Grd (eg) —. 
TQ) y 
where (-,-) is the ordinary L? inner product. Because of (24), this is equal to 
[2 Srevay(€(Q)n) 4 (f, Padocorm) po(O.em) 
>] lg|-" Srv) eee (Yi)oo.r.n)) (Pi)o(0,r.n) (x) an 
Therefore, we can formally rewrite the integral in (23) as: 
Lo Sry AF. Wey) Gday@) 
= fr) (Sa lO F. (Ye)o@.r.m) (Pe)o@.c.0) (x)) ype ee (25) 


Of course, if the summation only runs over a finite set of Q’s (as it will for us), the 
equality is literal. 
In proving Theorem 1.1, it will be more convenient to write (25) as 


/ af (lor'y. (Pox) VO) eae no ena. 
T(Vo) “Dp 


Vv H(Q) 1 


Proof of Theorem 1.1 We fix, once and for all, a function b € Ce°(R“) that is non- 
negative, has support contained in B(0; 1/2), and satisfies f bdx = 1. Recall our 
definition of zg = (xg, £(Q)), where xg is Q’s center and £(Q) is Q’s sidelength. If 
Q C R‘ is any cube then bz, is supported in Q and satisfies f bz ax = |Q|. If v is 
any doubling measure then 


/ bp dv ~ v0), (26) 
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with comparability constants depending on b and v. If Qo € D and 2/ << 1, we 
define F(Qo, 2’) to be the family of Carleson sequences {cg}p such that cg = 0 if 
O ¢ Qo or £(Q) < 2/£(Qo). It is clear that the set of numbers 


1(Qo)' > cop(Q) : Qo € D, {co}p € F(Qo,2/) (27) 


D 


is bounded above by | + |j|. Call the actual supremum L(/). Theorem 1.1 will follow 
once we show that sup; L(j) < oo. 


Fix j. There exist a Qo € D and a Carleson sequence {Zg}p € F(Qo,2/) such 
that 


1(Qo)' ¥ > Zon(Q) = (1/2)L(). 
D 


Fix Qo and {Cg}. Theorem 1.1 will follow if we show that (Qo)! )>5 Co(Q) is 
bounded by a number independent of Qo and j. 


Define 
FQ) = D1 Fobeo (x). 
PD 
Because of (26), 
[fau~ Deon) ~ LOW). (28) 
D 


As with Theorem 2.2, the “game” now is to show that 


/ fPdw<c if fl dy, (29) 


for some C < oo independent of Qo and j; because, as we have seen, the Cauchy- 
Schwarz inequality will imply 


/ If|dw < Cu(Qo); 
which, with (28), will yield 
Lj) < C, 


for some absolute C independent of Qo and j. 
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Because of (28), (29) will follow from 
[Pau = cLpucoo) 


It is obvious that f is supported inside Qp and satisfies f |f| dx < |Qo|. It will be 
important to us that f € BMO, with a BMO norm bounded by a constant depending 
only on b and d; so let us prove this. Write f = )°,f;, where 


fi) = D> Sodio (x). 
Q: £(Q)=2 


Each f; is infinitely differentiable and satisfies: (i) |lfxlloo < 1; and (ii) ||Viclloo < 
C2-*, We note that inequality (ii) implies |Vf| < C(2/€(Qo))~! pointwise. 
Let Q’ be a cube and write 


f= ye Sk + a fe = Fit Fo. 


k: 2k>€(Q’) k: 2k <£(Q’) 
We can cover Q’ with C(d) congruent dyadic cubes {QF ie such that (1/2)£(Q’) < 


€(Q*) < £(Q’), which implies that, if Q ¢ D and €(Q) < £(Q’), then £(Q) < £(Q7) 
for every j; hence, if QM Q’ # @ then Q Cc Q* for some j. Then: 


[ |FaQ@)|dx = [ S> Zgbeg(x) | dx 


" \ 000) <4) 


< / : > Cobzg(x) | dx 


jai 0:0c0* 


J=1 @:0Cc 0* 


C(d) 


<)> a" 
1 
< CQ’. 


On the other hand, | VF) (x)| < C/€(Q’), implying that 
[rie -@olars cel. 
Q’ 


Therefore f belongs to BMO, with a norm < C. 
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We invoke a standard fact about BMO ([4], p. 159): If h ¢ BMO,T € S, and 
fT dx = 0, then, for all cubes Q C R4, 


1 _ dt dy 
a flier Ton or < chal. 
|O| 0 y 


where the constant C only depends on I. This implies that, for h € BMO, the 
sequence of numbers {cg}p defined by 


1 _ dt dy 
to= = | Jn «xy “Toy? — 
|Q| J rio) y 


is a bounded multiple of a Carleson sequence. 
We can write f = g1 + g2 + g3 + ga, where 


a= > | 
82(x) = >| 


n 


g(x) = >> 
1 


_ 7 dt dy 
Ff *y¥ “Woy (O) ¥ “(be) 0.) & — 2) — 
(ty): y<I-(Qo)} y 


_ = dt dy 
f *y “(VO (D) ¥ “(Px) Oy) & — 2) — 
(ty): 21 (Qo)<y<l(Qo)} » 


_ 7 dt dy 
i" GF ist Gone 5 
{(t.y): €(Qo0)<y<3(Qo)} y 


): y>3£(Qo)} 


n d d 
g4(x) = ya F *¥ “Don OY “Godoy &-1 idee 
1 {(ty y 


Lemmas 3.3 and 3.5 imply that the integrals on the right-hand sides all converge 
absolutely. By Lemma 3.5, g4 is pointwise bounded by C|Qo|7' f |f|dx < C for 
x € Qo, and it is easy to see that the same bound holds for g3. Since f € C&°(R“) 
and |Vf| < C(2/€(Qo))~! pointwise, Lemma 3.3 implies that |g;| is bounded by 
an absolute constant in Qo. Thus, for x € Qo, we may write f = gz + G, where 
|G| < C, and C does not depend on Qp or j. 

By Lemma 3.1, there is an R such that, for every 1 < k <n, every T-sequence ¢, 
and every finite sequence {Ag}p C C, 


2 
(Px )e(o) 2 
Ao— =| dusR) ddl’. 
/ dX Vv H(Q) dX 
We claim that 
[ leal? dpe < CRL() (Qo) (30) 


for a constant C depending on y and d, but not on Qp or j. Since Oo IG? du < 
CyL(Qo), proving (30) will finish the proof. 
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__ There exist N = MN(d) dyadic cubes {Q; ", congruent to Qo, such that 
Q;M Qo # @. If x € Qo then the support restriction on the ¢,’s implies that 


n 
= = dt dy 
g(x) = >> ae Fy “(Yo O) ¥ “(Pd 03) @—-1) —. 
fa 2 Alby): EY Oi, Y-!(Qo) <y<l(Oo)} » 


For each 0 < i < Nand 1 <k <n, define 


- = dt dy 
ore / | (Fey (U oO) ¥“Gdoy@—) 22. 
{(ty): t€Q;, 2! €(Q0)<y<l(Qo)} y 


Inequality (30) will follow once we show 


ij Iviel? du < CRLG) (Qi), (31) 


because jz’s doubling property implies w(Q;) < Ci(Qo). 
For 0 < i < N, we define F; to be the (finite!) family of dyadic subcubes Q of Q; 
such that 2/£(Q;) < £(Q) < £(Q;). We can then write: 


dtd 
vices) = >> fey “(WI oyOY “Gvoy@-1 sade 
oeF, ° TQ) y 


We rewrite the last equation as 


Vik) = im > 


oD OEF; 


o(Q,t _», dtd 
(ion (nan) Vit) Pee a), eae) 


Vv H(Q) 7 


For each (t, 7) € T(Vo), 


2 
7 (Pi )o(0,1.n) (&) 
: ’ wU a(Q,T, a d 
/ zl (f, Widoo ») ¥e()| =a ul 


is less than or equal to R times 


2 
D [lor WrexmrVa@] = Yo (EAP eeenll) po), 


2 
QeF; QeF; IOI 


Thus, by the generalized Minkowski inequality, 


1/2 
: Ddooun)? dtdn 
ikl ) < RV? / IF, Wedo.em)l —2g dtd 
(/ lyin? de a (= ( oP HQ) a — 
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But (T(Vo), 74 on) is a finite measure space (with a total measure only 
depending on d); therefore, 


ee 1/2 
l(f, (Voorn)! ~2d dt dy 
hs pe ( |Q| ae, n 


is less than or equal to a dimensional constant times 


1/2 
\(f, (Wdoo,r.n) iF —2d dt dn - 
es ( |O|? Ko) 4 UT 


which implies that 


‘sient _4, dtd 
JiraP an = CR » I. | (Ceeegsall Ko) n 2d - 0 


OEF; 


lf, Porm)? \ oa dt dn 
= CR If. Gide nm)? dedn | 
2, (/., ( |O|? 7 n H(Q) 


But, for each Q € F;, by the change of variables formula (24), 


If, Wdowarm)? \ 2g at dn - = dt dy 
/ ee | 4 — = JK! / If ey “(Ho OP? —: 
T(Vo) |Q| 1 T(0) y 


and, because f € BMO, with ||f||4 < C, the sequence defined by 
= = dt dy 
cielo [rey toyior 

T(Q) y 


is a bounded multiple of a Carleson sequence. By our definition of L(j), 


S> cois(Q) < CL()W(Q;) 


QEF; 


(because all of the Q’s occurring in the sum satisfy £(Q) > 2/£(Q,) and are contained 
in Q;). Therefore 
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[inten sce (ior [ retovonorS a2) wo) 


OEF; 


= CRY) coin(Q) 


QEF; 
< CRL(/) (Qi), 


finishing the proof of Theorem 1.1. & 


We present an easy corollary of Theorem 1.1. We first note that, by duality, if 
{wWite C L?(v) satisfies (2), then, for all f € L?(v), 


Yoh vedo? S x | If? dv (32) 
k 


(where we use (-,-), to denote the inner product in L7(v)); and, conversely, if 
{Wit C L?(v) satisfies (32), it satisfies (2). 
In [9] the author looked at linear operators of the form 


(f, yo i. 
» So) Peo (33) 


for a doubling measure v, sequences of functions {yp and {6p in Cy, and 
T-sequences ¢ and ¢’. One can think of (33) as a simple model for a wavelet 
representation of a Calderén-Zygmund singular integral operator (see [5] and 
references cited there). By Littlewood-Paley theory, if the w'2’s and 4s lie 
in Cy and v € Ago then (33) defines a bounded linear operator on L?(v) in the 


following sense: If F#; C F2 C #3 C --- is any increasing sequence of finite 
subsets of D such that D = U;F; then, for all f € L7(v), 
py) 
F. Ve)» 
T(F)(x) = Jim ges 50) Peo) (X) (34) 
Oe 


exists in L?(v) and ||T(f)||22«) < Cv, @)|Ifllz2q)-' We present a partial converse: 


Corollary 4.1 Suppose that 1 is doubling. Let {4} C L°(B(0; 1)) satisfy CNDC 
and suppose that, for each 1 < k <nand each T-sequence ¢, the series 


» is Eis * (bne(), (35) 
D 


defined as in (34), yields an L?(1) bounded linear operator. Then t € Ago 


'This also holds in L?(v), 1 < p < 00, and the cancelation hypotheses can be weakened [9]. 
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Proof Call the operator defined by (35) T. If T is L?(w) bounded then 
(TMF aul < Cf fF du for all f € L(y). But 


2 lf, (bde@)ul? 
(Pfs oe 
: dX L(Q) 


Therefore, by the converse to (32), (6) is almost-orthogonal in L?(). QED. & 


Remark We believe the most natural application of Corollary 4.1 is this. Let 
w € Cyo be real, radial, and non-trivial. If jz is doubling and the series 


S LE teas Yo 


(with the sum defined as above) gives an L?(j1) bounded operator for every 
T-sequence ¢, then jt € Ago 


Acknowledgements We are grateful to the referee for spotting a gap in the proof of Lemma 2.1 
and for valuable suggestions on improving the paper’s exposition. 
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